Under the editorship of 
M R Nimkoff 

Tke Florida State Unirmity 




Statistical Reasoning in Sociology 


John H. Mueller 
Karl F. Sckuessler 
Indiana Unmrjity 


f) 

OXFORD & IBH PUBLISHING CO. 

Calcutta Bombay New Delhi 



Cop)Tighi © 1961, By John H. Mo«Uer 
and Karl F. Scjiucssler 


All right* reserved, including ihe right to r«prodiicc this 
boolc or parts iherrof in any form 


Indian Edition 1969 Published by arrangeii“*tt^ with tlie 
original publishers Houghton Mifltin 
CompaiTvv.Bowon. VJ.S.A. 


Price : Rs. |2 75 


For sate in India, Pakistan, Burma, Thailand, 
Ceylon and Indonesia. 


This book has been publuhed with the a5sistance of the 
Joint Indiaa'AmericanTembook Programme 



Preface 


Scope of Staiisiies. Conventionally, the field of statistics is broken down 
into two divisions: (1) descriptive statistics and (2) infeienrial statistics. 
’Descriplire statistics deals with the condensation of large masses of data 
into frequency distributions, measures of location, disperrion, and correla- 
tion, and other more complicated measures snch as wdghted indexes, 
standardized rates, and factor loadings. But since we often find it incon- 
venient or impossble to assemble completely the data which are the object 
of our interest and are obliged to obtain our information from a mere 
fragment, usually called a sample, we art compelled to infer from the 
fragmentary sample the probable description of the unseen population. 
Tlris inductive procesa Is not a simple task, and elaborate technique^ have 
therefore been devised to render this process dependable. These ha^'e been 
labeled infermlial statistics. Inferential statistics, therefore, take their 
departure from the sampling of an unknown universe, which the sample 
wfl] retreacti«ly describe with varying degrees of reliability. Hence, 
the process and techniques of sampling beget problems of reliability, con- 
fidence, and decirion-making. 

Now, although the fundamental pcoblecs of descriptive statistics are 
only incompletely soh-ed, still they are currently not so challenging to 
the im'entive Impulses of stafbticians and mathematicians as are the prob- 
lems of sampling and decision-making, which at present appear to be on 
the growing edge of the discipline of statistics. During recent decades, 
sampling and its associated issues ha>x been the center of exploratory 
interest and have pre-empted the energies of creative scholars in mathe- 
matics and statistics. This coocenltation of interest on the fascinating 
frontier of statistical knowledge has been reflected in the content of some 
recent elementary textbooks which have e%’en gone so far as to define the 
field of statistics restrictively in such phrases aa "decirion-mafcing in the 
face of uncertainty.” To such theorists, the definition of “ordering 
quantitative data" would perhaps seem hopelessly old fashioned. 



Cop^Tight © 1961, By John H. Mueller 
and Karl F. Schuessler 


Ml nghtj reserved, including the right to reproduce this 
book or parts thereof in any form 


Indian Edition 1969 Published by arrangement with the 
original publishers Houghton Mifllin 
Company. Boston. U.S.A. 


Price ; Rs. 12.75 


For sale in India, Pakistan, Burma, Thailand, 
Ceylon and Indonesia. 


This book has been published svith the assistance of the 
Joint Indian-AmerkanTeiKbook Programme 


Published by Oafbrd & IBH Pubibhtng Co., Oxford Bldg., 
N-88, Connaught CSrcus.New Delhi-l. and primed 
By R.N. Sachdev, at Allied Publishers Private Ltd. 
(Printing DivUion)4,NaJid^h Road, 

New Dellu-lS. 



Peetace 


common, lor example, to find iludents embartiDg on partial correlations 
who seem unaware of tlie posabilities in cross-tabulation and subclassiGca- 
tion. The exposure of inner rdations does not alwaj'S reqiure such com- 
plicated manipulation; indeed, many of the complex de^-iccs conceal what 
relatively simpler devices may reveal. 

The more leisurely and intense culti\'ation of fundamentals, which is 
one of the objectives of this text, may appear to decelerate the pace of 
progress of the student. In exchange, however, he will absorb a more 
mature and flexible understanding of statistical knowledge and a finer 
feeling for quantitative reasoning, as well as enhance his comprehenaon of 
sociological processes. Impatience to do "research’' without such an 
adequate basic repertoire of tools and procedures only leads to immature 
applications, mechanical cookbook routines, limited and distorted compre- 
hension of what you have when j’ou haw it, and inadequate reservations in 
interpretation. 

Furthermore, many of the social data are of such nature and composition 
that they cannot profit b3* complex treatment and analysis, nor do thej’ 
e^’cn deserve to be dressed in the trappings of recondite formulas: shabby 
data may take on a superficially sdiolarly hue and a semblance of preciaon 
which constitutes scientism of a most unworthy sort. 

Special Empkaces. In conformity with the thought of assigning a some- 
what hi^er status to descriptive procedures than seems to be current 
custom, we have included treatment of certain topics which are usually 
omitted. Among the more important and useful are standardisation and 
nonning operations. Qualitative variables, in contradistinction to quan- 
titati%-e ^-ariables, are given a somewhat more prominent place than is 
their usual lot, reflecting the currenc3" of such data in sociolog3’. Corre- 
sponding to this emphasis, the characteristics of attributes, their tabulation* 
and measures of their variation and correlation are analyzed in a somewhat 
extended manner. 

The forms of correlation are presented in the reverse of the usual order. 
Instead of initiating the topic of correlation with the traditional Pearsonian 
product-moment method and linear prediction, the subject of relationship 
is launched with the simpler measures of association between dichotomous 
variables: Q and The student is thereby inducted much more gently 
into the field of rdationslups; ODI3' when the concepts of joint occurrence, 
covariation, and marginal distribution are firmly established is the more 
complex Pearsonian correlation introduced. The approach to co%Tiriation 
is made via the principle of explained variation, which renders it at once 
amenable to both linear and curvilmear relations and b thus more flexible 
in its sociolc^cal application. 

Perhaps the most significant deviation from the usual repertoire of 
techniques b the omissoa of the traditional discussion of pm*?? samples 

vii 



PBErACE 


Consistent with this conccpUon of statistics, such authors have opened 
their discussion with sampling. But not being able to ignore the descriptive 
aspects of statistics, they interrupt the «mtinuity wth short digressions 
on the rudiments of descriptive statistics, after which they again pick up 
the threads of inferential statistics. 

It is our contention, however, that the definition of statistics exclusively 
in terms of inference and decision-making confuses the part with the 
^•liole — that IS, the segment which happens to be m the focus of current 
methodological interest with the entire field of statistical interest. 
is often labeled "modem " statisUcs has, hou’ever, not displaced descriptive 
statistics, but merely extended it into more advanced regions. In spite 
of the current trends, the aim of statistics is not exhausted in sampling; 
the sample still sen es as a vehicle for the ultimate description of the uni- 
verse 

It would therefore appear that the present pedagogical fashion im- 
prudently relegates descnptii'e statistics to a position of secondary impor- 
tance, and conveys the impression that descriptive techniques can be 
casually acquired. But many of the fundamentals of statistical reasoning 
are concentrated in descriptive procedures and are too useful in their own 
right to be employed merely as stepping stones to inferential procedures. 
Indeed, the direction of senicc is the rcicrse: the sample servos the cause 
of the universe. We should therefore first know uliat \\c mean by certain 
descriptive characteristics before making inferences about them from the 
sample. It is inefficient and confusing f o estimate the value of a population 
characteristic before the meaning of that characteristic is perfectly clear 
in all of its statistical — and substantive — implications The student 
will simply not get the feel for the mean and the standard deviation, for 
example, when such concepts arc accorded only incidental and passing 
treatment. 

In this text, therefore, descriptive statistics are first presented separately, 
uncomplicated by reference to simpKog procedures, wiiich are sufficiently 
difficult to acquire even after descriptive statistics have become second 
nature, tte think that the beginning student will appreciate such con- 
sideration. 

Ihe premature emph.vsis on iufcicntial, to the neglect of descriptive, 
statistics lias the further unfortunate tide effect of an oierhasty adoption 
of complex measures and formulas which arc only vaguely visualized, and 
certainly not adequately appreciated There seems to prevail among 
many students, who h.ave acquired some facility m the middle range of 
statistical tiaming, a tendency to use thdr most “advanced” knowledge, 
when more thorough probing of ampler devices would be more cITectivc 
and mc-mingful, This inclination to embrace complicated models, which 
are often much less effective than moic elementary methods in displaving 
the data, actually inhibits sound quantitative reasoning. It is not' uii- 
vi 



macWn«s, without Use intimate fanuKarily in nd^ture that ia acquired by 
calculation, would lead only to routinistic, mechanical, and robot 
statistics. 

KlacUce calculation, it ia therefore contended, has little place in the 
elementary course which emphases fundamentals. Xot only are the 
problems not bo elaborate as to require a machbe, but the entire empha^ 
should be on the acquistion of priodples and the encouragement of quan- 
titati\-e reasoning, uncluttered bj* excessive arithmetic calculations which 
add nothing to Btatistical theory. In fact, complicated machine calcula- 
tions will only becloud the internal issues for the immature worker. Com- 
puting outfits are no more meticulous than sausage machines in clarifjTng 
the materials that pass through them, unless a knowledgeable person 
eelects the ingredients, orders them, and creates proper mixtures. 

Afe/Ama<fcol PrerequmUs. This text joins the increaang number of those 
that do not make demands beyond the ordinary' high school mathematical 
background. Indeed, such a reassurance become so standard in the 
textbook blurbs that it has almost attained the status of a cliche. The 
apolc^tic reason often pvta for this academic practice ia amply that 
the college student does not possess mathematical competence beyond high 
school algebra anj'way, and we are therefore forced to adjust oursehTS to 
the deplorable reality. 

It is unquestionably true that students are able to enter the university 
with inadequate eorepclencc even in arithmetic, to ssy nothing of algebra 
and trigonometry. However, it b not a question of surrendering to the 
laxities of high school education, (or there are more constructive reasona 
for the alleged disregard of mathematical requirements in elementary 
statistics. It seems to the present authors, as well as to others, that much 
statistical reasoning may actuaUy be compcienUy carried on with only a 
good arithmetic b.aekground. Furthermore, many mathematical formulas 
may be put to adequate and profitable use without a comprehension of the 
technical means bj* which these formulas are dcri\*cd. Consequently, if the 
formulas are not to bo derived anyway, higher mathematical training b 
less essentbl to their implcmenlatioo. In fact, a considerable degree of 
statistical skill can be dc%Tloped on, at most, a base of elementary algebra, 
at least during the first — and posably second — year of academic train- 
ing. Such a concession does not argue afGUalion *dth the “ statbtics-m-ade- 
ca.«j’'* fdiool of thought. Fonnal dciOs must be mastered if thej* are to 
imaginatively applied to social data. On the other hand, the mechan- 
ically Caalcss application of such procedures without reg^ for their 
rclci-anco to subject matter can readily lead to nonsensical and deceptive 
conclusions. It b the aim of thb book to cullirate the art of statistical 
reasoning by focusing on the onioa of neutral arithmetic routine and 
concrete subject matter. 



Pbztacz 


and “Student’s” 1. This was dehbertte, although such a decision is recog- 
nized as harboring some reservation. Small-sample theory may be viewed 
as an amendment to the techniques rf sampling procedure rather than as 
a new body of theory; and it has been the principle of selection for this text 
not to enter mto the manifold variants of standard procedures, but rather 
to concentrate on the thorough understanding of fundamentals within the 
scope of an elementary framework. 

Mathematical notation has been employed as sparingly as possible. In- 
structors are not always fully aware of the frustration experienced by the 
students in acquiring a new “alphabet.” Symbols arc never introduced in 
anticipation of their later use, but only wben actually required. 

By far, most of the data employed in illustrating statistical processes 
are taken from sociological and related sources. It is, of course, obvious 
that such meaningful materials elicit and reinforce student interest, which 
might otherwise tend to lag. However, in certain instances, hypothetical 
data of a very simplified sort have been introduced for experimental 
demonstrations of the behavior of data because they are so cosily ma- 
neuvered, and more clearly expose the procedures under examination. 
Such imaginary data actually become more meaningful than the collected 
empirical materials, whjch are often too cumbersome for the purpose. 

In statistics, as in other disciplines, a certain amount of pedagogical 
energy ta devoted to mere tcrmioolo^cal problems. This is inevitable, siDce 
the only channel of communication is terminology. Kot until an author 
has examined several hundred standard references, in the process of in- 
creasing that number by one, does be rcaliie the variation in the use of 
concepts. Fortunately, much of the controversy on terminology t» merely 
terminological and does not alter one whit the statistical practice. We 
have attempted to codify what seems to be good practice, but have veered 
therefrom at times for purposes of conristency. Occasionally such devia- 
tions have been briefly noted. 

Maehtne Calculoiim. An impresrion prevails in some quarters that desk 
calculators and electromc computers are tending to displace pcncU-and- 
paper statistics, as cultivated in such a text as the one before us. No 
greater self-deception is conceivable than the one that machinea have 
rendered obsolete the old-fashioned virtues of hand calculation. The type- 
writer and printing press have not driven out of existence the slower 
methods of penmanship ; pencil and paper have not become vestigial tools. 
There are several reasons for the inaslcnce on pcncil-and-papcr methods 
in this text. Many students, as well as many workers ia the field, do not 
have electronic computers available, or even desk calculators. Nor are 
most of the problems encountered in ordinary affairs sufficiently elaborate 
to require such expensive apparatus. Even if electronic machines were the 
ultimate answer to all problems, it would Btill be true that the use of 
viii 



Preface 


ilcfcrwucffrfjmfnls. While every book is truly a group effort, it is also true 
that some individuals contribute more tangibly than others. Foremost 
among the contributors to this book are the many students whose thought- 
ful reactions enabled us to see more clearly vrhat issues are most trouble- 
some to the beginner. Also high in our debt are those graduate assistants 
in social statistics whose critical exanunation of study problems led in 
many instances to revisions of those ecposjlorj' portions of the text wliich 
appeared to be wanting in clarity or completeness. In particular, it is a 
pleasure to thank Cherry Carter, Herbert Costner, and Donald Homing, 
who attended to such a large part ot the numerical detail that necessarily 
enters into a statistical work but which is often inconspicuous in the final 
product. We are especially Indebted to Professor Peter Rossi, whose 
critical hut sympathetic reading of the entire manuscript issued in numer- 
ous constructive suggestions. 

We are indebted to Professor Sir Ronald Fisher, F.R.S., Cambridge, 
and to Dr. Frank Yates, F.R.S., Rothamsted, also to Messrs. Oliver & 
Boyd Ltd., Edinburgh, for penrassion to reprint Table IV from their 
book Slatislical TabUs for Biological, Agricultural and Medical Research. 
Finally, to the following publishers we are obliged for their permission to 
reproduce copyrighted materials; The American Journal of Sociology, The 
American Sociological Review, Cbapmanand Hall Ltd., Columbia University 
Press, Educational and Ptyehologieal Measuremeni, Harper i Brothers, 
McGraw-Hill Book Co., Inc., Metropolitan Life Insurance Co., Prentice- 
Hall, Inc., Princeton University Press, Simon and Schuster, Transactions 
of the Society of Actuaries, the University of Chicago Press, John Wiley & 
Sons, Inc., and Yale University Press. 


Jons H. illTEtlER 
K\rl F. Schuessleb 


Bloomington, Indiana 
November, 1960 



PREriCB 


Sociology and Stalistics. This text has been prepared primarily for soci- 
ologists. since other disciplines will necessarily select dilTerent elements. 
Furthermore, sociology has lagged behind economics, psychology, and 
education in its cultivation of statistics. This unfortunate lag expresses 
itself in the more or less independent development of statistics outside the 
scope of the basic courses in the department usually offered on the lower 
levels and even in graduate diviaons. Unlike economics and psychology, 
the introductoTj* courses in sociology assume very little in the way of 
statistical knoirledge or training. Indeed, in many instances, the first 
elements of statistics are not even embarked upon until the student has 
attained graduate status. This text seeks to break down that compart- 
mentahfation by gearing the presentation to the undergraduate, with 
courses on middle levels primarily in mind; but it is hoped that it will 
contribute to the cultivation of quantitative methods among students of 
all ranks, including, of course, the graduate student without previous 
statistical training. 

It is recognised that many students in sociology are not destined to 
become professional workers in the field. By far, most of them will become 
consumers, and not producers, of statistical materials. We therefore be- 
lieve that the mastery of introductory statistical techniques and the com- 
prehension of the principles involved will carry with it cultural benefits 
similar to those gained in the pursuit of substantive courses. 

And even professionally, many graduate.s will discover that descriptive 
statistics arc sufficient for their purposes and that inferential statistics 
are only rarely called into play. Their primary need will therefore bu for 
those skills which enable them to "move” their field observations onto 
the printed page by compiling them and presenting them in a manner 
appropriate to their purposes and faithful to the nature of the events them- 
selves — m short, m such a way that the statistics do not "lie.” Hence, 
the beginning student should not be deluged by a confusing number of 
V ariants of the same procedure, but should be inducted into a limited num- 
ber of fundamental methods, which should be accompanied by rational an- 
alysis to ensure competent application to social data. Consistent with this 
point of view, the references at the close of each chapter are appended, not 
as a pool of highly technical informarion, but rather as a souicc of greater 
enlightenment on the role of statistics in social life and social science. 

This book had its inception and development in the experience of teach- 
ing statistics to undergraduate and graduate students in sociology and tlie 
allied social sciences over a period of years. Many of the questions treated 
m this text in a somewhat expansive manner took their origin in the queries 
of the more reflective studentit The tcaribw wjJJ, cd cov.'vf, proceeti st 
his own pace, mindful of the stated objecliv’cs of the text and also of the 
ability of his students. 

X 



Contents 


Chapter l 

StaUitits in Sotiat Ujt 

3 

Chapter 2 

Social Vatiahtes and Their hAeasurtment 


§2.1 

PofidiffiCQUls of McisorczaeQt 

11 

§2.2 

Erron in Measorcmenc 

19 

§2.3 

Rousied Mcuores 

23 

Chapter 3 

Gfoupini ef Data 


§3.1 

QiuatiuciT« Variables 

31 

§3J 

QtulitattTe Variables 

50 

§3.3 

Cross-Class ificatioo 

57 

§3.4 

Time Scries 

66 

Chapter 4 

Graphic Presentation 


§ 4.1 

The Histogram and the Frequency Polygon 

75 

§4J2 

The CmnulaiiTe Frequency Polygon 

85 

§4.3 

Graphs of Qualitative Data 

92 

§4.4 

Time Graphs 

98 

Chapter 5 

Aceraies 


§5.1 

"nie Mode 

110 

§5.2 

The Median, or Position Average 

121 

§ 5.3 

The hfean, or Arithmetic Average 

129 

§5.4 

Criteria for Choice of Average 

143 


xiii 



ue plnsed to icknowtedge the help of those readers 
who oUed our etteotioa to t]rpO£raphicaJ eod compun* 
tional errors in the first printing o( tbit boolc. Although 
such mistakes were not ouinetous, we have taken id* 
vanuge of diis second printing to ct«re« them, and we 
axe confident chat students will aocotdinglj be less distracted 
by minor Saws. 


John H. MueUet 
Karl F. Schuessler 



CoKTENTS 


Chapter 12 Statistical Inference 
§12.1 Parameter Estimation 

§ 12.2 Testing a Statistical Hjrpothesis 

Appendix 
Index ef l^ames 
Index of Subjects 


361 


417 

437 

439 


XV 



COWTENTS 


6 

'^U.I 

Variation 


Measures of Sjnrad 

131 

S62 

Variation as Measured bjr Aritbmetic Deriations 

157 

§6.3 

Variation as Measured by Squared Deviations 

162 

§ 64 

The Normal I^stribution as a Pattern of Variation 

169 

§6.3 

Variation of Qualitative Variables 

177 

Chapter 7 

Nermin^ Optratiem 


§71 

Elementary Norming 

181 

§72 

Subclassiiication and Standardization 

189 

Chapter s 

Probability 


§81 

The Nature of Probability 

204 

§8.2 

Probability Calculation 

214 

§8.3 

Combinations and Permutations 

222 

•§8.4 

Binomial Probabilities 

228 

Chapter 9 

Mtaturtmint of Association: Qualitativt 
Variables 


§9.1 

Concept of Statistical Association 

258 

19.2 

Coefficient of Association'. Yule's Q 

242 

§9.3 

The Phi Coefficient 

252 

§94 

Coefficient of Contingency 

262 

Chapter 10 

Covariation: Quantitative Variables 


§ 10.1 

Correlation of Ranks: Rho 

270 

1 10.2 

Scatter Diagram and Cotreladoa Table 

281 

1 10.3 

Linear Correlation of Two Variables 

293 

§ 10.4 

The Correlation Ratio 

315 

§ 105 

Some General Guides to the Interpretation of 



Correlation * 

324 

Chapter 11 

Sampling 

337 



Statistical Reasoning in Sociology 



Statistics in Social Life 


The Indxridual and the CoUecttce. If social events, such as marriages, 
births and deaths, crimes and delinquencies, or public opinions were no 
more numerous than chairs in the living room or children in the family, 
there would be no great difficulty in apprehending (becir In fact, there 
would be relatively little need to generalize about them, for the human 
intellect could encompass them separately and individually. But when, 
instead of a small number of ea^y identifiable objects, we are called 
upon to treat large aggregates, individualization becomes mentally im> 
possible. Consequently we can view them only collectively, In terms of 
one or more common characteristics, smd with corresponding incomplete' 
ness. The shepherd of Biblical antiquity “who calleth his own sheep by 
name** must have had small Socks indeed. Today, on the Australian or 
American ranch, the large Sock would be counted, classified, and sum- 
marized in various ways, in the course of which the indiriduals would lose 
their identity in the total anonymous mass. So inadequate has the indi- 
vidualistic approach become, that, for purposes of description or predic- 
tion, varioiis devices of summarization have been e%t)lved; or a limited 
number of individuals (a sample) are selected from the aggregate to repre- 
sent the total, about which generalizations may be made with trying 
degrees of precision. Such ^economical numerical procedures haw, of 
course, been in use for centuries; but during the last two hundred years, 
these procedures have become fonnalized and have come to be known 
as “statistics.** 

This evolution in the history of culture from naming to counting was 
neither a product of arbitrarj* and spontaneous personal choice nor a 
prirate accomplishment. It was rather a matter of gradual compulsive 
social adaptation to the exigendes of an expanding social organization 
which had become increasingly immense and complex, in which sjunbols 

3 




Statistics nc Social Lit* 

in the mid-tiTenticth century the National Safety Council. Qu^telet 
feelingly comments on his epoch-making discoveries: 

Thus we pass from one year to another - . - seeing the same crimes repro- 
duced in the same order, and calling down the same punishments. . . . 
Wc might enumerate in advance . . . bow many will be forgers, how many 
will be prisoners; almost we can enumerate in advance the births and deaths 
that should occur.* 

In mid-century, Henry Thomas Buckle (I82I-1S62), the English social 
historian, caused similar consternation and moral indignation among his 
incredulous pious readers by calling attention to the fact that, year after 
year, about 250 persons committed suicide. After pointing out that, among 
public crimes, none seems to be so completely dependent on individual 
impulse as suicide, he still observed that 

it is surely an astonbhing fact that til the eNidence we possess leaves no 
doubt . . . that suieidca are merely the product of a general condition of 
society, and that the indiridual felon carrira into effect what is a necessary 
consequence of preceditig cimircslances. In a given state of society a certain 
number of persons must put as end to their own life-t 

There hns also been a secular version to these objectiona. Although 
such opposition has by now been almost dissipated, it was once thought 
that social science should not hope to emulate the mechanical methods 
of the phy’sical sciences by treating social behavior quantitatively. It 
should rather restnet itself to the operations of mental phenomena such 
os undentanding, insight, empathy, sympathetic introspection, and 
other subjective techniques that the German theorists have subsumed 
under Grislesunssentcfiajl. Such inteUectual methods, it is averred, are 
appropriate to the true nature of society, which has its being in subjective 
communication Statistics, it is still thought, somehow debases our ob- 
servations and dehumanizes society. It is an unnatural imposition of 
external techniques upon social realities, which exist in the cultural im- 
agination and are therefore essenliaDy unquantihable. Statistics squeezes 
human beha^dor into alien cat^orics, and is an illegitimate rival to other 
methods which recognize the true nature of humanity. Strangely enough, 
many persons who casually’ assert that “most people” “usually” behave 
in such and such a manner, and who predict that the “chances are” so 
and so, are still offended at the thought of pinning down human behavior 
to quantitative terms. 

Further “dangers and fallades” of statistical method have been be- 
labored by its critics. It is said that statistics can offer only probabilities, 

“ F. H. IlAcldcs, Aiolfhe Q-ufuiei a$ SSataUeian, New York, 190S, p. 83. 
t John Venn, Lo^e cf Chance, London, 1876, p. 467. 


5 



Statistics ix Social Xjte 


became a substitute for the individual object itself. Statistics, therefore, 
is an instrument for viewing the mass of happenings around us. To be 
sure, in our private life we are stUl concerned with personal opinions, 
marriages, deaths, and many individual transactions. But in our social, 
collective relations, we are concerned with public opinion polls, popula- 
tion and vital statistics, actuarial descnptions, and predictions, on the 
basis of which society pursues its collective interests. 

To accomplish these ends, it has been necessary to develop a number 
of quantitative techniques which serve not only to describe coherently 
the collecUvity of life around us, but also to anticipate, or forecast, their 
recurrences in the future, wluch we express in terms of degrees of proba- 
bility 

The concept of statistics may therefore be deSned in two related senses: 
(1) the factual data themselves, such as vital statistics, statistics on trade, 
production, and the like; and (2) the methods, theories, and techniques 
by means of which the collected descriptions are summarized and inter- 
preted Although the procedures of statistics, as they have been elabo- 
rated and lotraalized, may sometimes appear to the uninitiated as recon- 
dite, and remote from social mterests, they are in fact merely an extension 
of what every intelligent person does anyway. They have become a 
personal and social necessity, reflecting our need for understanding the 
past and anticipating the future, as they involve the treatment of large 
masses of phenomena. Hence, the field of statistics has sometimes been 
characterized as the “science of large numbers," in which la^e volumes 
of data are systematically compressed into smaller and more manageable 
compass 

Vahdiiy oj the Statisiical Method. The statistical method is not only a set 
of practicable techniques, but it is also an ideologj’ which validates the 
use and apphcation of these techniques. There are, of course, other views 
of nature besides the statistical, as is evidenced by the opposition which 
this method has aroused in some quartets during its history, especially 
when applied to human behavior. By some moralists, for example, the 
endeavor to reduce human behavior to statistical regularities has been 
condemned as materialistic and as a denial of the axiomatic free will of 
man To set forth the quantitative regultnties of inanimate nature is 
one thing, but to reduce the behavior of men’s souls to mechanical laws 
is to undermine the very foundation of personal responsibility and morality. 
Quaelet UT96-1874), the Belgian statistician, was among the first toapply 
these quantitative procedures to human behavior in citing the constancy 
in the number of crimes from year to year. This regularity, he asserted, 
could be used as a basis of probabilistic prediction — in the common- 
pUee manner that automobile deaths are forecast for holiday weekends 

4 



STAnsno IN So«AL Liix 

In the modem few, flalistiea began to be linked with mathematics 
and to take on its early characteristics in the fcventeenth century with 
Dlai«e Pafcal and Pierre de Fermat; but it assumed its more mature 
phjfic^omy in the eighteenth and mneteenth centuries. Abraham de 
Moivre discovered the normal curve around 1730, about tlie time of 
Newton’s death .(1727). Although descriptive ftatistics, in a kind of 
primitive a'ay, dates back to the census of the Diblical epochs — a fact 
which hardly detracts from its contemporar}* importance — probability 
statistics had to await the emergence of other social needs and interests. 

The las^•8 of probability, which together with descriptive statistics make 
up the basic dichotomy of current statistical method, liad their beginnings 
in two quite different social circumstances: (I) the study of the problems 
of gambling, which was a pastime of the French nobility; and (2) the 
actuarb! interests of the 3ritl»h commercial bourgeoisie. With fome 
oversimplification in statement, the former gave rise to what is now known 
as classical a prion probabilities; the latter to empirical probabilities. 
Wth the apparent intention of eliminatingfome of the risk from gambling, 
the eighteenth century gamblers consulted malhcmaticbas like Pascal, 
dc MoIntc, and Daniel Bernoulli to illuminate the outcomes of games of 
chance. On the other hand, among the merchants of Puritan Engbnd, 
the motivation for etatbtieal enlightenment was perhaps on a more 
rerpectable pbne. Interested, as they were, in the growth of capitalism 
and in demography, they cuHis-ated problems of an aetuarbl nature, 
which became the basis of their statbtical calculations. As early a.s the 
eeventccntb century, the Englishman John Graunt (1C20-74) was the 
first to measure the regubritics in the duratioo of human life, and to con* 
rider the rebtion of birth and death rates to occupations and business con- 
ditions. lie found fairly constant demographic rales which were among 
the first eccubr evidences of nn orderly pbn in nature for population 
behavior. In 1G93 Edmund Halley published the first mortality tables. 
By the beginning of the nineteenth centuiy nnd beyond, Pierre de Ijipbcc 
in France, Adolphe Qu^tclct in Belgium, Harl Friednch Gauss in Ger- 
many, and Sir Francis Gallon and Karl Pearson in Engbnd, were adding 
important increments to the science of probabilities from both the mathe- 
matical and empirical sides. 

Thus, the practice of statistics arose out of the convergence of two 
sources: (1) descriptive data of a political, economic, and demographic 
character, and (2) the mather72atiea} re^rches in probabUii'ies. These 
were supplemented toward the end of the eighteenth centuiy by astrono- 
mers, who developed the theory of errors out of their astronomical meas- 
urements. Qu6telet was the first to fuse these interests for socbl purposes. 
In the btter part of the nineteenth centurj', biologj’ and genetics began 
to contribute significantly through the work of Francis Gallon and Ivarl 
Pearson and R. A. Fisher, who are now known for their development of 

7 


Statistics in Social Lire 


not certainty; that they apply mily to the mass, not to the individual. 
Theansweris that these assertionsare unreservedly true. But they should 
be riewed as characteristics of the method, rather than limitations. An 
invidious distinction has thereby been dmsra between statistical forms of 
knowledge, and other forms of knowledge, as though these other forms of 
human knowledge did not also fall short of omniscience. 

For a person who hiu profited by statistics, it is difficult to share such 
s epticism. The statistical method, rather than replacing “insight,” 
rei orccs insight. No person can become less human by employing sta- 
tistics, nor more human by avcMing them. 

As a Mtter of fact, there is a continuity between common sense, which 
orma y makes rough quantitative judgments, and statistics, which is 
not only a more formal and precise version of such knowledge, but is 
scope. Although many a person may "lie with 
tivelv eschew statistics probably lie just as effec- 

“ >>>« ot wri,." The mbiectiv. end the 

each other learamg to coexist and to supplement 

tirties were^^e Udm a Prevalent early consumers of mass sta- 

olfcir dnlin '»■>«"> ley in Ite wolth 

faeture the fiiianeld ^'’^“ “ ‘^'"Population.itsagricultureandinanu- 

However, the rsttspn* °Sy tbe term in the concept of “state." 
Elimbcthan England quantitative connotations. In 

faan, whose repute at!nt«.t..i ^ political functionary or states- 

<^n”: Hamlet avers that ^ ”'^”®Ponded to the modem term "politi- 

I««eedidhold,t.„ou,,utbtsdo. 

^ Abaseoesstowritefair, . . 

public, ndimniftratkn* ^ the study of state, or 

‘'f the State. This concepS^M^'^ philosophy 

»nd m the early nineteenth to England as “statistics,” 

.r,.h„e,ir-. - the fJniW Stales. Thu, "sUle 

”” “*■>• evolved into ‘'statistjp^^- ** *dthmetic” in the seventeenth 
ner, e<tmed to rhjTne with hAii:.*- P”^®*^tly normal linguistic man- 
«ng a ej-stemaiic analj-sis of areas nf***" *°?^**^' ®nd other concepts denot- 
fovernmenta are stin voracious knowledge. Although 

of rtatinics. the concept h^ "tic producers 

of activity m the physical ®*^“ded to innumerable other 

^e earmarks of its PoUtS^S. ^ 



Statistics in Social Lite 

abstract, formula. These will nei’er quite conform to each other, and 
sometimes, in fact, are only a rough fit — and this for \-arious specific 
reasons. (1) Often the data are only crudely quantifiable, as, for example, 
attitudes and opinions. (2) Every summarization of data can be only a 
compilation of one, or a few, aspects of the observed objects; these aspects 
are, so to speak, tom out of context. But the factors that are excluded 
from the formula do not thereby cease operating! (3) Formulas vary in 
their sensitivity to the complexity of the data and pick up only limited 
aspects of their manifold characteristics. 

Thus, the data must be prepared and groomed for the formula, which 
must be fed the data in a form that it can digest. For example, a formula 
cannot absorb subjective attitudes, but will accept the data only when in 
the form of measured behavioral equivalents. A formula is very like a 
machine, which should not be asked to do a piece of work for which it is 
not adapted; otherwise, the formula will chew up the data wluch are 
dumped into it and produce a chaotic mass of uninterpretable digits. 
Hence, the application of statistics requires continued discretion grounded 
in the knowledge of the subject matter to be interpreted. 

Such emphasis on quantitative reasoning will excite in the student a 
curiosity about the solutions to the realistic problems and thereby enlarge 
his sociological imagination. For it can hardly be expected that every 
student in the social sciences would develop a “ mathematician’s delight" in 
the contemplation of the elegance of an abstract, mathematical procedure, 
any more than a mathematician will work up excitement in unraveling 
divorce and delinquency rates. 

Civilized man is literally engulfed with statbtics: public opinion polls, 
cost of living indexes, popubtion data and vital statistics, actuarial calcu- 
lations in life insurance, games of chance, blindfold tests, and many other 
evidences of the prevailing quantitative view of life. The common man, 
in his unsophisticated way, employs crude statistical concepts when he 
speaks of averages, hunches and hypotheses, probabilities, chance, long 
run, samples, and of "all other things being equal.’’ Statistics, in the 
sense that it involves skill in thinking, is therefore not a redundant aca- 
demic accomplishment, but is an integral element in the current thought- 
ways of our civilization. The development of the contents of this volume 
and the selection of the problems and questions for thought hai c been 
de.signcd to foster those skills in statistical reasoning which are essential 
not only to the professional social scientist but to the intelligent citizen 
as well. 

Questions and Problems 

1. Consult the dictionaiy and other reference books for definitions of statistics. 

2. Distinguish between numerology and quantification; quantification and sta- 
tistics. 


9 



Statistics in Social Life 

theorie. o! «s.oc«ition and corrcklion and other technioues ot measure- 
ment . , 

Id retrospect, it is proGtable to contemplata the diverse seminal sources 
of statistical methods. They received their impetus from a wide variety 
of social practices and intellectual disciplines; demography, commerce, 
games of chance, astronomy, biology, agriculture, and mathematics. They 
have now extended their applications to linguistics, anthropology, com- 
munications theory, aesthetics, and other disciplines. They either touch 
lightly, or are heavily involved in, almost all of the conventional scholarly 
disciplines 

Principle 0/ .Sfottslfcal Reasoning. Statistics, when competently cul- 
tivated by the social scientist, comprises much more than the manipula- 
tion of figures and formulas. Statistical procedures, when applied, consist 
m relating or fitting social data to the appropriate statistical formulas and 
cciuationa. But even this can degenerate into (ot not rise above) the hand- 
book approach of merely plugging the data into some formula and routinely 
solving the equation. Elective application consists in the careful sub- 
stantive analysis ol social data for their adaptability to the statistical 
techniques, as well as the careful assessment of the technique for its poten- 
tialities in processing the data, to the end that the model and the data may 
be mutually compatible and neatly joined. The two frames of reference 
aro coordinate and inseparable; bence, in order to implement the applica- 
tion, familiarity with both the statistical principles and the subject matter 
are equally essential. In this text, considerable emphasis has been laid 
on the fusion of elementary statistical procedures with the social data; 
for this collaboration between the two disciplines constitutes the very 
basis of what we shall call sialiftical reasoning. 

Such reasoning is made necessary by the fact that a statistical formula 
or model nesei quite fils the social data; nor do the empirical social data 
neces.'anly fit the ideal procnistean structure of the statistical models. 
IVe should not be duped by the external precision of a pat formula or by 
the “exactitude" ol numbers. A dozen eggs do not fit the uniformity of 
the arithmetic digits which represent them; the distribution of a finite 
number of heights or weights does not quite attain the smoothness of the 
abstract normal curve which assumes an infinite number; the rank-order- 
ing of choices does not result in equidistant ranks, as the numerical ratings 
may imply. Whatever the readjustments or rearrangements we may 
undertake for greater precision, the abstraction still remains an abstraction 
that is not a duplicate of the observed empirical objects. It therefore lies 
inthm the very nature of the statistical method and the nature of reality 
that the two do not exactly ciwcsde. 

Thus, the solution of any statistical problem involves the dovetailing 
of more or lesa uaperfecUy observed fragments of data with an ideal 
8 



Social Variables 
and Their M.easurement 



Section One 

Fundamental of Measurement 


Social Need 0/ Any event or object in nature may be coii- 

eidered as being made up of a number of component factors: the chemical 
contents of the human body, the ingredients of a cake, the elements in the 
production of a disease, the social factors that produce a ^'ar, or the 
dimensions of a social institution. It b the aim of the physical and be- 
havioral sciences to analyze an event in order to make it understandable. 
Howe%’er, it is not enough merely to identify the factors which enter into 
an event; it is also necessary to measure their force, intensity, or quantity. 
A chemical formula, a blood count, the amount o! a medicinal dose, the 
intensity of a social attitude, the size of a population — all testify to the 
principle that measurement is essential to knowledge. » iritAout measure- 
rneni there would be no tiatisiies. 

But measurement implies that we have units of measure. How to devise 
effective units of measure, and bow to make nature stand still long enough 
to apply these devices, are basic problems in both the physical and social 
sciences. The typical laboratory method, by which nature can be placed 
under a microscope or poured into a beaker and heated to the boiling point, 
to say nothing of the utilization of enormously more complicated appara- 
tus, is an invention of the pbyacal sdentist for the express purpose of 
immobilizing nature long enough to be manipulated, observed, and 
measured. 

It is frequently asserted, and perhaps with some support, that social 
data are collected and quantified with considerably more difficulty than 
are physical data. There are, indeed, many obstacles interposed between 
the social scientist and his ultimate measures: the data often cover a large 


11 



STATiancs IN SoCTAL Life 

3. Elaborate the statements: "StatBtlcs is an esten&ion of common sense”; 
“Every man is an informal staUstieian-” 

4. Compare “the statistical view of the universe" with other conceptions of the 
structure of the universe. 

5. List and discuss social factors in fie growth of statistics in Western civiliaa- 
tion. 

6. Formulate the argument that staUstieal methods cannot provide valid knowl- 
edge about man and society. 


Selected Referewces 


Cohen, Morris R , A Pre/aee to toffie Henry Holt and Company, New 
York, 1944. Chapter 7 

Hu5, Darrell, llinB to Ut vai* StoltNie*. W. W. Norton 4 Company, Ine , 
New York, 1954 

Kendall, Maurice G., and Witlian R. BucUand, A Dittioncry cf StaHstieal 
Tirm) Oliver 4 Boyd, London, 1957. 

Kline, Morris, il/atfirmaiier la fTrsfem C«f(ur< Oxford University Press, 
New York, 1053. Chsplete 22-34. 

Lundberg, George A., “Statistict in Modem Social Thought," in Con- 
lemporary Sonal Thtorg Edited by Harry Elmer Barnes, Howard 
Becker, and Frances Bennett Becker. D. Appleton-Ceatury Com- 
pany, New York, 1940. Pages 110-139. 

Mahalanobls, P. C , The Unietrtily TeocAinj of Sooiol Sriencee: Stofuftcs. 
UNESCO, 1957. 

Walker, Helen, Sludirj in Ott Hitloni «S Stofitficol MiOiodt. The Williams 
4 Wilkine Company, Baltimore, 1929. 

Wallis, W Allen, and Harry V.lloberte,5toriificf ANta AfproacH The 
Free Frees, Glencoe, 111 , 19SC 

Willcox, Walter F, “History”; Robert M. Woodbury, "Btatiatical 
Practice’’; Oskar N Anderson, “Statistical Method”; under "Sta- 
tielica” in the Fncvclopadia of lie Social Science}. The Macmillan 
Company, New York, J934. Vohnne 14 



5 2.1 Fcotasientals or MEAsijiiiatENT 

and female; a person’s marital status may be single, married, widowed, 
or divorced; nationality may be American, French, or Italian. The 
attributes of a given qualitath-e variable cannot be scaled, or arranged in 
order of magnitude. A given sex, nationality, or marital status cannot be 
considered as being “higher,” “greater,” or “larger” than another in 
their respective series.* 

Because of this limitation, it mi^ appear that the statistical method 
would have little to contribute to the analysis of qualitative data. But 
quite to the contrary; there is a large and growing body of statistical 
techniques now available for the treatment and manipulation of qualita- 
tive TOriables. These are of particular importance in sociologj’, simply 
because many significant sociological data are qualitative in form. 

The corresponding terms in the above classification may be succinctly 
set forth as follows: 


Variable* Values 

Quantitative Variates 

Qualitative Attributes 

Nevertheless, in its application, 61*60 a simple classification of variables 
as quantitative and qualitative ndll turn up apparent ambiguities, because 
some traits may seem to fit both types. Thus, occupation, religion, and 
crime are undoubtedly qualitative variables which are not intrinsically 
measurable in units as are age, weight, or size of population. And yet we 
could consider religious denominations as more or less orthodox; crimes 
may be mnked in order of seriousness and by severity of punishment; 
and occupations may be scaled according to their social prestige. Such 
measures, however, are not intrinsic to the attributes; they merely reflect 
the separable social values that often adhere to such categories. Different 
states classify crimes differently; communists and capitalists would rank • 
occupations in different order. 

In reverse fashion, we often attach what appears to be a qualitative 
concept to clearly quantitative data. Vanous age groups may be quali- 

• Some authors apply the concept “vanalion'" only to variables that can vary in quan- 
tity. Qualitative data are therefore aoiDettmea labelled as "iion-vamble” or “eaumera- 
tive,” emphasizing thereby tbatquabtative categories cannot be measured as magnitudes, 
but only enumerated as qualitative tiaita. Magnitude and quality would therefore be 
antithetical concepts. This te^t, however, shares the position of those who define var- 
iation as covenng both quantitative and qualitative charsetenatics. 

This disputation in terminology can hardly be dignified as a semantic problem, since 
there is never a question of essential meaning if terms are consistently employed. It is 
simply a problem in labeling. The notnenelature employed by statisticians is by no 
means undorm, not only as applied to concepts here under discussion, but to other sta- 
tistical concepts as well. This circumstance leads to confusion in the minds of students 
who obtain their statistical education from a variety of academic sources In subse- 
quent instances, this lack o! unilormity irill not always call lor comment, but the stu- 
dent should constantly be alert to that possibility. 


13 



SocuL Vahiabiis and TnaB Meastobmcnt 

social area; their collection may violate the ficoas of privacy of the subjecls, 
they may be subjective in nature and hence diflicult to objectify. Further- 
more, society is heterogeneous and dynamic, and many of its dimensions 
do not lend themselves too well to stondaidiicd units oi categories. Thus, 
social distance is more difScult to measure than geograpliical distance; 
social forces are more elusive than arc phy'sical forces. 

Nevertheless, we must start with the ba.«ie assumption tlml anything 
that exists must exist in amounts of more or less, or with greater or lesser 
frequency. Accordingly, there arc innumerabliflegitiniate questions which 
social scientists are called upon to illuminate How, for example, can one 
count the number ol Negroes in the I'nited States when their numbers 
are changing every day, and the x-ery concept of race is subject to a variety 
of definitions? How may we secure measurable data on the simple a-®scr- 
tions tliat Areericaas are manying younger now than several decades 
previously; that there is less unemplojonent now th.an in the past; th.at 
crime increases m postwar yo.irs; tliat race prejudice is decreasing: that 
men are better drivers than women, or that the taste for modem music is 
nsmg? How are we to count and measure the (actors in juvenile delin- 
quency or in divorce? 

These questions have these features in common: (1) they call for objec- 
tive, quantitative answof*, and (2) tbe appropriate units of mea*uro are 
not immediately apparent and arc Almost always difficult to devt'-e. 
Broadly spealdng, the manner of quantifiratson is governed by tlio nature 
of the variables under consideration. 


OuanlitehieondQualitalire VanaWes. .\ny object or event which can vary 
in successive observations cither in ((uaiitity or quality may be termed a 
raria6/c. Variables are accordingly cbssifieiia* quantitative or qu.vlilalive. 

A guantituliic variable ia one which may take on various ituiguitudcs, 
I e , may e.xist in greater or smaller amounts. Ilvample^ of quantitative 
variables are. age, height, income, size of population, 'iso of family, length 
of prison term, birth rate, and nurneruu^ other characteri-ties. All persons 
possess the trait of age, but some have a higher and Foino n lower ngel 
cities are of varying sizes; convicts serve prison terms of varying Icngtlis. 
These variables can be measured, and each resulting m.aguitudo is called 
a eanafe A set of variates, such as the heights of the individual soldiers 
in a platoon, or the re.spective sires of 1,000 families, Mn be arrangid in 
order of magnitude from the snuillcst to the We can locate them 

on a scale. This inevitable order of the data CvVtv be accepted as a con- 
venient earmark by which a quantitative variable can be distinguishctl 
from the qualitative type. 


A quahtathe variable 
magnitude, but rather n 
called a«rj'6utes. Thus, 

12 


may also vary in successive observations, not iii 
a quality or kind; sucli qualifies arc custonnirily 
sex will vary according to the attributes of male 



Social VASUSias aSL Tbeir Measbbeuest 

tatively designated, in order, as “infanta," “children,” “adolescents,” 
and “adults”; letter grades may be substitated for raw test-scores; the 
United States Census calls places with fewer than 2,500 inhabitants 
“rural” and all larger places "urban.” But no one should be misled by 
these labels into believing that the data have thereby been transmuted 
from quantitative into qualitative; we have merely attached new verbal 
symbols to arbitrary segments cS the data. Anyone familiar with the 
symbolism will still interpret the data quantitatively. These labels, which 
merely stand for intervals of varying width, are pragmatically more con- 
venient than the precise raw measures. The mere reduction in precision 
does not change the nature of the data. 

Discrete and Cantinuema VartaWe*. Statistical procedures differ not only 
according to whether the data are qualitative or quantitative, but also 
according to whether they are discrete or continuous. Discrete variables 
are based upon events which are conudered indivisible, which do not vary 
in amount, and which are therefore merely present or absent. We cannot 
fractionate attributes: a person does not commit half a crime, or aecure 
a fraction of a divorce, or hold one-third of an occupation. An event is 
either a crime or not a crime, either a divorce or not a divorce. Since there 
ate no gradations, attributes have no natural sequence or continuity; 
there is no scale distance between them; it is only possible to “jump” 
from one value to another. It is obvious that all qualitative variables are 
by nature discrete. 

Quantitative data, on the other band, rosy be either discrete (discon- 
tinuous) or continuous, which explains why the distinction b usually 
reserved for them. In thb restricted sense, a discrete variable always 
involves counting the number of events, for example: the number of per- 
sons in each faimly, the number of whole days each worker is absent, the 
• number of inhabitants in each city, and so on. As these examples suggest, 
a discrete variable consists only of whole numbers, and fractional values 
cannot occur, 

Conimuous variables, on the other hand, are theoretically infinitely 
divisible into smaller and smaller fractional units. Age, distance, weight, 
intelligence quotients, and various kinds of rates can take any one of the 
mnumerable small values on the scale, or continuum. In order for a person 
to pass from eight to trine years of age, he necessarily passes through every 
minute gradation between the respecUve years and at some time has 
occupied every one of the values on that continuum, even though he cannot 
pause to measure them. 

Hence, a measure of a continuous variable can never be considered 
exact. However accurate the measure, we can always conceive of the 
hypothetical possibility of a still miwe precise measure. The one-thou» 
sandth of an inch on the micrometer, the split second in a photo-finish of 

14 



$ 2.1 Fdnoauektals of Measubeueht 

described. Hence, we must necessarily rely on measures of things which 
are accessible and observable, and which may be accepted as operalicmalli/ 
equivalent to the subjective variable which is inaccessible to direct measure- 
ment. 

To the question, "Is racial antipathy toward the Negro declining?’’, we 
would in effect reply: "We cannot penetrate the psyche of the population 
directly, we are not ‘mind readers’; but we can observe and measure the 
annual lynchings, the rate of desegr^ation, the occupational changes in 
the Negro population, the content of editorial expression in journals and 
newspapers, and other behavioristic maniftetations which must be pre- 
sumed to be the equivalent of the subjective attitudes of the population 
in question.” Similarly, elusive phenomena, such as happiness in marriage, 
have been measured by such behavioral analogues os duration of marriage, 
the collaborative activities of the spouses, the number of tensions incurred. 
Or, again, changes in musical taste are measured by the content analysis 
of the concert repertoire to which the audience attends, or the volume of 
specified records which it purchases. 

This indirect procedure in measurement is by no means restricted to 
social science. In physics, heat is measured indirectly by. the expansion 
of a column of mercury; electricity is also measured not in terms of itself 
but by the work it does. We can measure^only what can be perceived by 
the senses. f 

Research scholars in the social sciences have axercised considerable 
ingenuity in devising equivalents for the measurement of the intangibles 
of social life. But such circuitous measures are encumbered with certain 
problems from which direct measures are quite free. It is important to 
realize that equivalents are not identities. Among the many alternatives 
which could be proposed, no single one, nor several in combination, will 
be a perfect replica of the thing they presumably represent. In the study 
of racial antipathy, the number of lynchings, interracial club memberships, 
intermarriage, and other correlates are not equally sensitive in registering 
the degree of positive or negative attitudes. 

In spite of these and other difficulties, there is nothing essentially new 
or esoteric in the employment of behavioral equivalents. Common sense 
has always made inferences about individual motives, attitudes, and values 
on the basis of observable bebaxHor. In fact, the aphorism that “actions 
speak louder than words” is a recognition of the xiew that actions may be 
more reliable as attitude indicators than mere "words,” which among 
honest people presumably refiect their basic attitudes and beliefs. 

(4) Lastly, measurement may consist in ranking a series of objects 
according to a selected criterion- Such ordinal measures may be con- 
structed on the basis of the objective measures already described; or they 
may be purely subjective in character, with little or no overt indication 
of the basis of judgment. Thus, in a relatively objective manner, cities 

17 



Socui. VaBIASLES Airo ThEI« MEAStrBEMENT 

Although these issues may seem somewhat trickily pedantic to some 
readers, the concepts of continuity and discontinuity are basic to numerous 
statistical interpretations. 

Dnices for the Meas^rmeni of Obaemd Dala. From the fact that^ vari- 
ables arc not uniformly constituted we have already been able to discern 
that no single method of meaaitement will work on. all types of data. 
We here formulate four •measuring devices, presented in order of obje^ 
tivity and amenability to replication: (I) direct enumeration; (2) appli- 
cation of RbsliRCt standard unite of measurement; (3) behavioral equiv- 
alents of the phenomena under observation; and (4) ordinal ranking. 

(1) A direei enumeration la a simple count of items ia terms of them- 
selves. The categorical item itself is the unit of measure. Thus, the 
number of divorces, crimes, persons, or the number of rights and wrongs 
in a questionnaire are all direct enumerations of the defined units. All 
of the counted items are considered identical, while their differences and 
variations are ignored In fact, in order to be additive, the items must be 
viewed as identical. This is the simplest conceivable type of quantifica- 
tion, familiar to every boy who counts his marbles. 

(2) As already intimated, a direct count sometimes lacks desirable pre- 
cision because it ignores the variation between items. Bananas, which 
vary in size, may be sold by the dozen (a direct enumeration), or by the 
pound. By converting bananas to pounds, we employ, on a slightly higher 
level of abstraction, a etandard unit of measure against which the objects 
arc matched and compared. The pound, the inch, the year, are conven- 
tional norms of measurement understood by all members of our society. 
They have replaced the more primitive units of comparison, such as the 
hand, the foot, the thumb, the cubit (forearm), the pace, and horsepower 
— some of which still linger in folklore. But none satisfy the requirements 
of precision in modern commenual transactions and technology. 

Both of these types answer to the requirement of objectivity and are 
readily subject to replication by other observers. They produce what 
some have named ratio scales, since relative intervals that are equal may 
be expressed by the same ratio. 

(3) The third measuring device utilizes what we shall call the behavioral 
equivalent, which yields measures that are less direct and les.s valid than 
those of the forenamed procedures. In the social sciences, much of the 
subject matter consists of patterns of attitudes and values which are sig- 
nificant for the understanding of human behavior. However, these intangi- 
bles are not accessible to material frequency counts or palpable measuring 
sticks Public opinion, sodal attitudes, social status, marital happiness, 
race antipathy, group morale — all constitute legitimate subject matter 
for sociological investigation; but their subjective, cultural character 
does not permil the application the direct measuring devices hitherto 
16 



5 2^ Ehrofis is Meastoemest 


Attribute 
Discrete Variable 
Continuous Variable 
Direct Enumeration 
Standard Unit of Measure 
Behavioral Equivalent 
Ordinal Hanlung 

2. Family size is a discrete variable, yet the average U.S. family has 3.7 persona. 
Reconcile these tvto atatementa. 

3. Fanuly income is measured in discrete dollars and cents, yet we have defined 
income as a continuous \’ariable. Explain Uda seeming inconsistency. 

4. Indicate whether the following variables are discrete or continuous: 

Attitude toward war 
Size of city 
Suicide rate 
Slob-machine payoff 
Age 

Number of births each year 

5. Give several examples of discrete and continuous variables from the field of 
social science not mentioned in the text. 

6. List several social variables that cannot be measured directly, but which could 
be measured by a behavioral equivalent. 

7. Individual A endorsed SO out of 100 items refiecting racial prejudice, while 
B endorsed 25. Is it reasonable to conclude that A is twice as prejudiced as B7 
Explm your answer. 

8. A group of students were asked to rank three professors according to teaching 
ability. What problems are involved in this measurement technique? 


Section Two 

Errors in Measurement 

Numbers and Things. The clas^cstion of social variables and the avTiil- 
able devices lor their measurement have been portrayed, not as inherent 
categories of nature, but as cultural inventions for the purpose of observing 
and ordering the world around ua. Since nmasurement is a human activity, 
all measures of such variables are necessarily infected with human falli- 
bility. If the methods of measurexiient in current use constitute improve- 
ments over those of a century ago, the various alternatives now available 
will in turn be replaced by modifications yet to be devised. 

To be sure, mathematical quantities, in which all measures are couched, 
have the external appearance of bong exact and absolute — a fact which 

19 



SoCUL VaWABLES A?n> Tirem MEASCBEitENT 

may simply be arrayed m order of ase after a careful count of their in- 
habitants, or married couples may be ranked for happiness after elaborate 
marriage-adjustment scores have been computed. In such instances, 
mere ranking involves throwing away carefully calculated accuracy, which 
may not be needed at the moment. On the other hand, in a predominantly 
subjective manner, pupils may be ranked by their fellows in order of 
personal preference, or pictures may be ordered by judges who doubtless 
apply diverse criteria, even though these criteria remain more or le‘*3 un- 
conscious, or at least unevpress^ Analogously, occupations, neighbor- 
hoods, poets and muMcians, may be ranged \o<wder of prestige. In general, 
any series of uWnlu/eJ may be ranked according to some external criterion; 
and larintiS arranged according to their respective magnitudes. 

It must be obvious that the slatistlcal difficulties in ranking lie in its 
relative coarseness Owing to its judgmental nature, (1) intervals between 
successive ranks arc not necessarily Meniical, and (2) two or more inde- 
pendent rankings of the same objects will not necessarily be comparable. 
However, when intuitive norma are the only ones available, or when pre- 
cision is of no great importance, rank-order measurements will seive their 
purposes 

Ordinal ranks, which arc merely measures of relative position, are, 
nevertheless, frequently given specific numerical but ortifieial weights. 
Thus, degrees of attitude intensity are frequently weighted from one to 
five, in order that the data may be made amenable to further analysis. 
Such admitted approximations are very useful and are prevalently em- 
ployed in social research. 

Each of the four measuring devices just described possesses its individual 
characteristics; each follows its oini rules and principles of interpretation. 
Furthermore, a given measure may actually exemplify two or more of the 
described types. For example, a direct enumeration of crimes or divorces, 
serving the purposes of a court docket, may at the same time serve opera- 
tionally as an indirect measure of social disorganization or family insta- 
bility, respectively. They all have in common, however, the important 
characteristic that they endeavor to express measurements in terms of 
symbols which permit the development and communication of social 
knowledge, and lay down the bases of measurement without which statistics 
would have no existence. 

Questions and Problems 
1 . Define the foUowiag concept*-. 

Variable 

Value 

Quantitative Variable 

Variate 

QuaLtative Variable 


18 



§ 22 EsBoas cr Meastjseuskt 

become as elastic as would a rubber yan^ick among textile mercbants, 
each of whom would expediently adjust the unit of measure with his own 
amount of stretch. The acknowledged unreliability of national crime 
statistics is a notorious illustration of the consequences of the employment 
of a loosely defined concept which no amoimt of meticulous statistical 
refinement can ever overcome. 

Nevertheless, concepts must still be defined as precisely as possible for 
purposes of sociological research; otherwise the results will be all the more 
ambiguous and dubious in value. Thus, in a survey on income, it is neces- 
sarj’ to specify what constitutes income — wages, gifts, dividends, legacies, 
prizes — in order to obtain results which are uniform and comparable. 
Similarly, in a study of sentenced embezzlers, it is necessary to define the 
nature of this offense in order to exclude thieves who are embezzlers in 
name only. Social data should always be collected under the clearest 
possible teorktng definHions, notwithstanding that svjch definitions will sd- 
dom be easy to formulate, and will usually contain some arbitrary elements. 

(2) Imperfeetions tn the Measuring Instrument. After the variable has 
been defined with all possible care and precision, the observations may 
still be inaccurate because of flaws in the measuring instrument. A faulty 
speedometer may overstate the mileage; adefective timepiece may nm late. 

Analogously, social measurements may be thrown into error by poorly 
conceived and inexpertly constructed measuring devices, such as the 
questionnaire and interview, which are so heavily relied on in sociological 
inquiry. An unduly long questionnaire may produce fatigue and indiffer* 
ence on the part of the subject, and thereby mar the accuracy of his 
responses. Questions may be presented m such order that successive replies 
are progressively distorted. Stereotyped words and phrases may evoke bi- 
assed reactions which could be avoided by the use of more neutral language. 

In the face-to-face interview, imprecise and leading questions ma y lead 
to confusion and frustration and fail to elicit that sense of participation 
which is so essential to accurate information. In addition, the personal 
contact may often produce inhibitions that would not disturb the response 
to an anonymous ballot. 

Recognizing such hazards, behavioral scientists have sought to improve 
their rating devices and score cards, census schedules, multiple-choice 
and open-end questions, focused and depth interviews, attitude and per- 
sonality scales, and similar instruments in th«r quest for dependable data. 
AU of these efforts testify to the principle that sound measurement is a 
prerequisite to statistical manipulation, which othetwise would be hollow 
and pretentious. 

(3) The Human Ohserver. The human mind is by no means a flawless 
recording instrument of natural events. Even under presumably identical 
conditions, two equally competent individuals will obtain div’eigent results. 
Thus, it is well known that in clocking the time of a foot race, in measuring 

21 



SoCUL Vabusxxs 


ThEIB MtiBtmZHENT 


hus conferred upon mothemalica tire vonemble prestige of being the most 
pure and exact of all disciplines. However, when abstract numbers are 
applied to concrete things, as ultimately they must, they necessarily 
sacrifice their purity and detachment. In the symbolism of arithmetic, 
for example, 4 + 4 = 8 for every one familiar with the conventions of the 
number system. However, when these numerals are made to symbolize 
eggs in a commercial transaction, four eggs added to four other eggs do 
not, in the same manner, equal eight eggs in the opinion of the economy- 
minded customer. The merchant struggles to approximate the exactness 
of pure numbers by grading the eggs according to various criteria — age, 
size, color — but he never succeeds completely in that endeavor. In the 
material world, the eight eggs are never as uniform as are the abstract 
units. Mere numbers will never duplicate the realities of the concrete 
world in which we must live. 

If this fact ia not borne in mind, we wall make the mistake of ascribing 
the ideal properties of disembodied numbers to the raw objects which 
they represent Numbers are only a system of symbols which never com- 
pletely reflect the objects they symbolize. By clothing our observations 
in numbers, we do not necess^y confer on the data of social science the 
simplicity and exactness of numbers. Observations can never be e.xpresscd 
without enot. 

However, Tec<^nitiOQ of th» principle does not mean that errors should 
be passively accepted. On the contrary, every efTort should be made to 
identify and reduce them, and thb is possible only if the sources of these 
errors are known. 

Source* of Meaturemeni Error. We here briefly consider four broad, over- 
lapping and interlocking sources of error that have various statistical 
implications: (1) vagueness In definition of the variable, (2) imperfections 
in the measuring instrument, (3) limitations of the human observer, and 
(4) the inconstancy and elu»vencss of social behavior. All of these sourcee 
are likely to influence jointly any given measure; their effects cannot be 
readily disentangled. 

(1) Voffuencss tn Definitim. Measurements will be inaccurate when the 
x'ariable to be measured is only vaguely defined. Divorce, nationality, 
war, race, family, occupation, nunorily, assinulation, crime, middle class, 
unemployment, and most other sociolr^cal concepts are difficult to define 
in an unambiguous manner — and all the more so over an extended dura- 
tion of time, or for multiple cultural areas In general, such concepts are 
not physical entities which may be peredved objectively by the five senses 
as are apples or chemicals; they are eultund definitions with somewhat 
varied content which reside as soual values among the individual members 
of a soaety, Scrupulous urnfonnily c£ interpretation on the part of differ- 
ent observers w therefore simply impossible. Such concepts sometimea 
20 



§ 2^ Eakors ijr MfiAsimEUENT 

But it is as difficult to separate chance and constant errors in practice 
as it is easy to distinguish them in principle. Thus, it would be impossible 
to determine, merely by inspection, whether the socio-economic ratings 
made by an upper-class interriewer had a constant error, although any 
sociologist would be alert to that possibility. Hence, in any responsible 
social inquiry, it is necessary to be alert to the possible presence of constant 
errors, although it is their very nature to elude us. 

(4) Inconstanq; and Eltisiventss of Social Behavior. WTien prisoners are 
herded by their guards Into an auditorium to hll out a questionnaire, they 
will react otherwise than they would if each one were permitted to act 
spontaneously in a more conventional habitat. Persons undergoing re- 
peated interviews by social workers, for example, are likely to become 
“interview-wise” and thus distort their replies by prevarication or down- 
right untruths, either to deceire or put their best foot forward. Democrats 
have been known to misrepresent themselves as Republicans, according 
to the reports of opinion polls, when reference to such membership seemed 
more "respectable." Incomes are e-xaggerated because of false pride or 
understated out of self-interest. U b commonly known, and indeed 
expected, that the residents of the Elrotowns, Middletowns, and Levit* 
to\nas often nfodify their behavior when they are conscious of being under 
obserx'ation. Group interaction io a laboratory setting b recognised for 
its artificiality; hence, to assure the normality of responses, investigators 
employ one-way mirrors and other comparable de\ices. Such sensitivity 
of human subjects under obserx-atioo differs markedly from the relative 
constancy of inanimate chemlcab in the bboratories of the natural sci- 
ences. 

Furthermore, many forms of human behavior are indeed beyond reach 
in a ph>’sical sense. It is impossible to enumerate all crimes, since many 
remain concealed; it is impossible to observe all criminals, since many 
criminals remain unapprehended and out of custody. A simibr argument 
applies to the establishment of the incidence of mental iUness. By the 
same token, it b impossible to acquire accurate data on the “passing” 
between races since, at least in the racially conscious United States, its 
very success depends on secrecy and concealment. Thus, the acquisition 
of complete data on many social phenomena is effectively precluded by 
reason of the play of social tabus and private interests. 

But social srience should uot tberriore suntmtiw to the comforts of an 
inferioritj' complex. Phj-sical science, too, sometimes suffers from similar 
handicaps. Failure to predict the weather accurately is in part attributed 
to the fact that obser\-ations in remote parts of the earth and in the strat- 
osphere are not available. Although earthquakes have been reduced to 
causal mechanisms, observations cannot be made in the deptlis of the earth 
which would allow geologists to predict them. 


23 



Socut Vamables AJn> Their MEAsuREsaCT 

social interaction under controlled laboratory conditions, or in rating ^'eU- 
ing units on a socio-economic scale, there will be discrepancies ascnbable 
to°individual variability in alertness, lempenment, intelligence, selecUve 
perception, and vbual acuity. 

When such errors compensate one another in the long njn, ’ire call them 
cfiance errorj. There is, however, another type of error which is more 
difBcnlt to control and which is the result of a tendency to record an event 
with a constant error, or fctas. So«nal measures may be particularly sus- 
ceptible to that type of enor, since social phenomena ore rarely percwvcd 
with the same degree of detachment as arc physical events. An inter- 
viewer may obtain replica that conform to hia own convictions, whether 
through misinterpretation of the replies or through suggestion to the 
respondent in the process of questioning. Thus, interviewers who them- 
selves believed that the United States should keep out of World War II 
obtained a larger percentage of “Keep Out" opinions than did those inter- 
viewers who favored a “Help England" policy (Table 3.3 3, p. Cl)- 
Even the judges of the United Stales Supreme Court are swayed by per- 
sonal bias and ideology, as revealed by their split decisions, and the recep- 
tion that a. "liberal” or "conservative" nominee to the court meets io the 
Senate Coramittee. 

At times heroic elTorts have been made by research agencies, necessarily 
with varying degrees of success, to enlist unbiased field obser\'ers and 
impartial investigators Thus, the Board of the Carnegie Corporation 
methodically searched for a disinterested scholar to direct the compre- 
hensive study of the Negro in America which they had inaugurated in 
the late nineteen-thirties. The Chairman oI the Board aclcnowledged that 

there nas no lack of competent scholan in the United States who were deeply 
interested in the problem and had olreadj devoted themselves to its study, 
but the whole question had been for nearly a hundred years so charged with 
emotion that it appeared wise to seek ns the responsible head of the under- 
taking aomeone who could approach his task wiUi a fresh mind, uninfluenced 
by traditional attitudes oi by earlier conriusons, and it waa therelore de- 
cided to "import” a general director. . . . And since the emotional factor 
affects the N^oea DO less than the whilea, the search was limited to countries 
of high intellectual and scholarly standards but with no background or tradi- 
tions of imperialism which mi^t lessen the confidence of the Negroes in the 
United Slates as to the complete impartiality of the study and the validity 
of its findmgs. Under these limitstioas, the obvioua places to look were 
Switrerland and the Scandmaviaa etnmtries, and the search ended in the 
selection of Dr. Gunnar Myrdal ... a professor in the University of Stock- 
holm, economic advisor to the SweiMsh goTemment, and A member of the 
Swedish Senate.* 

•From F P. Keppel’s foreword to Cuimar Mjidal, An Ameritav Dilrmma, Harper ft 

Brothers. New Voik, 1944, 1 , pp.vi-viu. 


22 



§2^ HouKDED Measures 


Family 

Nationality 

Employment 

Public opinion 

Household 

Delinquent 

(b) Why are concepts such as these difficult to dehne? Compare definitions of 
these concepts with those in the physical sciences, e.g., molecule, acid, 
osmosis, alloy, earthquake, tornado. 

4. Itemke some of the difficulties in making a reliable enumeration of: 

The mentally ill 
niegitimate births 
Interracial passers 

5. Illustrate how biases might operate in measuring social distance, social class, 
religious tolerance, prejudice, and culture bg. 

6. tVhat are the possible errors and mistakes and their sources in the following 
bi*pothetical measures? 

(a) Di\-orces have increased 20 per cent during the last SO years. 

(b) Twenty per cent of the bbor force is unemployed. 

(e) Middle<lass delmquency doubled in the last decade. 

7. Mistakes and errors represent deviations from the "tnie" value. In which 
instance is the true value more easily established? 

Section Three 

Kounded Mtasures 

Purpose of Roundtng. Because of tbe fauman equation, the imperfections 
in the measuring tools, and the other factors pre\uously described, most 
actual measurements are destined to remain to a greater or lesser degree 
imprecise. The official figures on births and deaths, marriages and divorces, 
population, and crime can never be more than approximate. In practice, 
therefore, we must terminate at some convenient stage in tbe process of 
measuring the attempt at precision. la colloquial speech, we “round off" 
when further exactitude is neither possible nor necessary. Furthermore, 
rounded figures are more easily manipulated, and therefore represent a 
/i'jAifiriil? rfmnnr?/' tfvenr Afcy.sf.flivfm? a/ A-Tjavary’, piwdsW 

that such accuracy is not essentiaL Thus, we arrive at two kinds of values: 
true values, which are generally unobtainable, and rounded values, which 
are employed in actual calculation. But rounding should not be capricious; 
it must be systematically carried out according to established rules. 

Rounding procedures. If measures are only approximate, they must 
obviously be either too high or too low. When values have been rounded 

25 



Socul. VjkMASlXa iHO TeEIR MSABUREMWfT 

ErroTt and Mtstake$. Errors which inhere in the measuring process must 
not be confused with mwtotea. In lay language, errors and mistake are 
synonymous, but in statsstical parUinoe these terms are clearly differ- 
entiated, even though their concrete denotations may at times be am- 
biguous. 

A belter name for mistakes would be “blunders" — blunders which 
derive from inexperience, incompetence, and other circumstances which 
a RcU-regulated inquiry seeks to reduce or eliminate. Although an occa- 
sional mistake is tolerated, too many of them constitute an unflattering 
reflection on the temperament, training, or meticulousness of the worker. 
A gauche inteniewer who offends his subjects, the apprentice sociolo^t 
who confuses race and culture, the asrislant who employs a wrong formula, 
produce avoidable mistakes which show up as blemishes on the countenance 
of the study. 

Honever, the term ‘‘error’’ carries no such disparagement. Although 
errors, too, vary in size and seriousness, and are therefore reducible, they 
may be caleulatingly accepted when the cost of reducing them exceeds the 
probable benefits of increased precision 

The fundamental importance of the distinction between errors and 
mistakes is that there are statistical techniques which can c5pe with error; 
but mistakes are beyond repair. In fact, ^nce errors are subject to the 
“law of error" because they are the result of ehonce factors and may therfr 
fore be evaluated. Even constant errors, or biases, which are the cons> 
quence of determining factors, although not analysable according to the 
law of error, may nevertheless often be adjusted by appropriate statistical 
techniques. Understatement of oge, the tendency to round at five-year 
intervals, the failure to report infants, and similar biased census responses 
may be adjusted by smoothing and other technical artifices. 

But mistakes and blunders are likely to be erratic and not subject to 
systematic analyses. Statistical methods are therefore helpless when con- 
fronted mth them; they should be banished Errors cannot be banished; 
consequently, their analysis and interpretation constitute one of the central 
problems in statistical method. 


Questions and Problems 

1. Deftcc the foUowiag cooeepts: 

Mea.«uTerocDt Error 
Chance Error 
Constant Enor 
MulaVe 


2. Distinpitth between number u mere symbol and as measurement. 

3. (a) In planning a sample surrey, how tmght a sUtistieal worker define the 


24 



§2^ Roc?a)i:i> Measures 


round to the nearest even number. Thus, 7.5 would be raised to 8, while 
6.5 would be lowered to 6. The justification for this practice is that, 
theoretically, even and odd numbers occur with equal frequency in the 
long run. Consequently, in rounding consistently to the nearest even 
number, half of the values Tsill be raised and the other half lowered, thereby 
canceling out the errors and lea^ang the sums free of rounding error (Col- 
umns 1 and 2). With uniform raising or lowering, however, the rounding 
errors would cumulate rather than compensate one another (Columns 3 
and 4). 


( 1 ) 

Observed Velu« 
6.3 

4.5 

3.5 

7.5 
22.0 


( 2 ) 

Nearest Erm 


(3) 

Next Higher 


S J 

22 24 


(4) 

Next Ixnter 
6 
4 
3 
_7 
20 


Snerilyof Rounding. According to the United States Census, the 1050 popu- 
lation of Washington, D.C., was 802,178. This figure not only requires an 
effort to remember and manipulate, but additionaUy, the last several digits 
are not even worth such an effort since they are almost certainly unreliable, 
owing to difficulties in enumemting large human populations. Hence, 
one would not hesitate to round the figure to S02,}S0 or 802,200, or even 
to 802,000, which seems sufficiently precise for all practical purposes. 
With the amount of error inherent In census-taking, we could just as well 
ha^e come out with a count of 801,937, which would also have rounded to 
S02,000. In any event, the question of bow far to round is almost alto- 
gether a substantive issue whose resolution will normally be guided by two 
criteria: (1) our estimate of the reliability of the last digits, which, if 
inaccurate, should be suppressed by rounding; and (2) the degree of 
imprecision we arc willing to tolerate. Underl^-ing both of these criteria 
ts, of course, the usual desire for economy of effort which rounded values 
permit. 


Significant Digits. The digits that have been retained, on the assumption 
that they are reliable, are called significant digits. The digits of doubtful 
dependability are dropped. In the case of whole numbers, as in the above 
instance, the discarded digits are repheed by leros merely in order to 
prcscr\e the location of the decimal point. Thus, in rounding 802,178 to 
the nearest thousand, the vacated places must be held by zeros; otherwise 
the unit of count (’000) would be lost, and the meaning of the number 
would be lost. However, other than to indicate the unit of count, such 
scros have no function. It is the significant digits that constitute the 
number. Thu?, it U the significant dipts **802" that specify how many 
thousands of persons reside in the city of Washington. 


27 



SocLU. Variables akd Their Measttreuent 


to the nezt lovtr, or last, unit, they are always too low; when rounded to 
the next higher unit, they wUl always be too high; and when rounded to 
tile nearejt vnil, they will be either too high or too low. We may illustrate 
these differences in rounding procedures with an example of age. A person 
may quote his age as 20. If he follows the popular practice of giving his 
age as of last birthday — a practice which the United States Census follows 
as well — he will convey the information that his true age is somewhere 
between exactly 20 and not quite 21 He has accordingly rounded his age 
to the next lower whole number cm the assumption that greater accuracy 
IS not called for. This rounding error, which may be as much as a whole 
year, is of course not a mistake, but a known consequence of the system 
of rounding employed. 

For actuarial purposes slightly greater precision is required. To an 
iimranee statistician, a quoted age of 20 signifies an age between 19.5 and 
20._5 years. This is rounding to the nearest whole year, or nearest birthday. 
1 IS a more exact procedure, since the rounding error in this case cannot 
be more than a half-year from the true age. For that reason, rounding 
to the nearet unit is the most common statistical procedure, and may be 
assumed unless it is otherwise noted. 

In oriental countries, age is rounded to the next h'lhtr unit, but this 
ee^Im Z "r it is adopted in 

lof^L lu ““““ example, postage eharges ate assessed 

o the fimt WkT.’ *'• (') duick ealculalion aecordiag 

Pstbo. ft.™ V ® maximisalioa oi ehargeT 

west does not vacate by oheckmut time. Workers who ate uaid hv the 

Sin" “y 

‘“.e?o'" “‘t must always 

to the week; the ages of adult ^ttisuts ages are often rounded 

ot five years- small ^ (s«imated) in multiples 

nomical national budgets may bTiSS'to th'*?ir° '''' 

tolerance limits. ^ billion according to our 

obser\ed value except when the 

mstance. an amendmS t-fo adjacent values.* In this 

nearest umt, 7.5 could logically be r™? rounding to the 

One solution U never to eet sturL .i j 

ibte St 7?'^ °f i-^creasing 



{2^ Roumied Mexscezs 


is the rounded number given, we would add and subtract .05 to obtain the 
true limits, 1.25-1.35. Thus, the true liauts will always consist of one more 
eignificant digit than the rounded value. 

The details of the forgoing operations hold only when rounding is to 
the nearest unit. When other rounding procedures are employed, the 
same general principles will obtain, although the computational details 
will vary. 

Questions and Problems 

1. Define the following concepts: 

True Value 
Rounded Value 
Significant Digit 
True Limits 
Midpoint 
Rounding Error 

2. Round the following to the nearest whole number: 

4.05 
4.S1 
4.50 
. 5.50 

4.601 

3. Round the followbg (a) to the next higher, and (b) to the next lower whole 
number: 


2.25 

0.0001 

4. Assume that 10 and 20 have been rounded to the nearest whole number. 

(a) Calculate; 

minim um title product 
maximum true product 
minimum true quotient of 10 -5- 20 
maximum true quotient of 10 -5- 20 
min imum true quotient of 20 -r 10 
maidmum true quotient of 20 -5* 10 
minimum true sum 
maximum true sum 

(b) What is the statistical significance of the variety in these answers? 


29 



Social Vasiables and Their Meastoeuent 


Occasionally, even reliable digits are dropped for pure convenience, as 
m the informal quotation of a bank balance of 1500, when the exact amount 
IS f511.74 But the rerra used as substitutes would, of course, not be 
considered significant. If, on the other hand, there is reason to believe 
that the last zeros are exact and therefore reliable, as in the salary of $200 
per week, such zeros would be regarded as Mgnificant. In such a case, the 
two zeros are part of the dollar count; they signify that 200 is part of the 
sequence 199, 2Q0, 201, rather than 100, 200, 300. 

Confidence ond Pre««on. The population of Washington, D.C., quoted 
as 802,178, contains six rignificnut. digits only on the assumption that aU 
digits are reliable and deserve confidence. A student of population, as 
already intimated, will question that reliabihty. The foregoing figure 
appears to be precise, but it is actually inaccurate. He will therefore fall 
back on the leas precise but more dependable number of 802,000, since it 
makes no pretense of being accurate in the last three places. Paradoxically, 
he will have more confidence in the /cm precise number. Thus, we may 
formulate the rule: other things being equal, as precision decreases, con- 
fidence increases; as precision increases, confidence decreases. A little 
reflection and experience wiU support the simple logic of this generalization, 
which is employed in many statisUcal ulualions. 


True Ltmtf* o/ Rounded Kumhtrs It b obvious that in rounding to the 
nearest whole unit, observed values such as 4.7, 4.9, 5.2, and 5.4 will all 
be rounded to 6. Hence, when we encounter that rounded value, we must 
be aware that the observed value, inaccurately represented by 5, may lie 
anywhere within the interval, 4.5 to 5.5. Such a reconstruction of what 
are labeled the true limiU from the rounded number is the reverse of the 
process of rounding That b, we infer that the original observed value 
must have been at a point somewhere along the interval, extending a half 
unit on either side of the rounded value, 5 Therefore, the rounded value 
b the midpoint of that interval and lies exactly half-way between the 
true limits. 

However, these true limits will be impossible to establish when the 
severity of the prior rounding b not indicated, as may be seen in the fol- 
lowing illustration; 


RomdedNumber Unitof Utanre True lAmiU 

400 Ones 399.5-100.5 

400 Tens 395 -405 

400 Hundreds 350 —150 

From these exampfes, we may formulate a working rule for establishing 
««<* aublracl/rom, the rounded number 
the rounding unit, we 

would add and subtract S, resulting in the true limiU of 395-405 If 1 3 
28 



Grouping of Data 



SccTiov Oxn 

Qt/tintitatire Variables 


All JtatL'tieal knowlodffo is knowledge of Urge aggreptes. But all collec- 
tion of data must l>egin with the individual ease. Thi* on important 
principle tlat li sometimoa lost tight of after the individual item has been 
nrallowod up by the mass. Nevertheless, we do not linger on the individual 
tveot, for reliable knowledge rests on the repetition of experience. But 
tie human mentality, unaided, cannot encompass a multitude of separate 
obsen-ations. If it attempts to do w>, It is likely to be unduly impressed 
hy conspicuous cases or influenced by expectations and desires, so that 
conclusions will be di.«tortcd and misleading It is one of the functions of 
^tistics to reduce such hazards by its tecliniqucs of organizing and con- 
masses of data into comprehensible form. So fundamental is 
this function that statistics has frequently been characterized as the 
science of large numbers." 

'There are two ba.sic types of statistical groupings, corresponding to the 
previous classification of variables: (1) the quantitative, in which variates 
are grouped into ordered class intervals; and (2) the qualitative, in which 
attributes are distributed into disparate categories. Thus, we may group 
t e incomes of 1,000 persons into any convenient number of graduated 
rotervals, but the classification of the same persons by sc.x is necessarily 
wofolcl, and the order of categories is arbitrary. In spite of their differ- 
ences, both quantitative and qualitative classifications culminate in the 
/reywncy dwfrtbutton, which is simply a tally of the cases in each of the 
^Pective classes. 

A possible exception to the foregoing division is the lime series, which is 
" ®faenng of values according to an entirely different dimension, namely, 

chronology of occurrence. 


31 



SocuL Vabubles and Tbeib Meascbement 

6. If a girl's height to the nearest inch la 65, between what true limits would her 
observed height fall? 

7. An array of homicide rates extends from .5 to 17.6, rounded to the nearest 
tenth, t^'hat are the true limits of this mtmralT 

S. Designate the gigniheant di^ in ead» of the folbwing nurabers; 

2.3 

00203 

.600 

.006 

800 

8,000 

9. Distinguish between .65 and .650 

10. (a) Is 51 centimeters necessarily leas precise than .51 meter? 

(b) Is 5,2S0 feet more precise than one mile? 

(c) Is 50 miles less precise than 50.25 miles? 

1 1 Two baseball players lave 60 and 59 hits out of 1 90 times at bat, respectively. 
Row would one determine the number of decimal places m the batting average? 
■Why does it usually consist of three places? [Note- The “batting average" 
is defined as the number of hits divided by the number of times at bat.] 

12 Explab why rounding to the nearest unit introduces less roundmg error than 
rounding to the last unit. 

13. (a) Formulate the rule for calculating the true limits of a number rounded 
to the tut unit, (b) What is the maximum error in such rounding; and !n 
what diieetion does h lie from the rounded viduel 


Selected References 


Bureau of the Ceneua, Census o/ PojniUuim: t9S0. Volume 2, Part 1, 
Un\Ui Slalet Summorv U^. GovemmeDt Pnnting Office, Washing- 
ton, D.C., 1953. Pages 1-66, 

Cohen, Morris, and £rnest Nagel, An Intmduction to Logic and Scifuti^ 
•VelAod. Harcourt, Brace and Company, New York, 1934. Chapter 


Dantiig, Tobias, Niim5er The Lsnjiuies Scienct. Fourth edition. 
Doubledsy Anchor Books, Garden City, N.Y., 1956. 

Lazanfeld, Paul F., "Problenia m Methodology,” in Sociology Today. 
Edited by Bobert K. Merton, Leonard 8. Cottrell, Jr. Basic Books. 
Inc., New York, 1959, Pages 39-78 

Torgerson, Warren 8 , Tkeorg and Mtlhodt of Staling John Wiley & Sons. 
Jdc , New York, 195S Cliaptera Z and 2. 

Walker, Ilelen M., Halhematict Eteenhal for Eitmenlarg SlatUHct. Re- 
HenryKoHmndCoini»ny, New York, 1951. Chapters 


30 



5 3.1 QcumTATm Variables 


Table 3-1. la {Continued) 


Cm 

(1) 

POFDlATIOJf 

(2) 

Sincmcs 

(3) 

Rate per 
100,000 

Houston, Texas 

596,163 

88 

14.8 

Indianapolis, Indiana 

427,173 

68 

15.4 

Jacksonville, florida 

2(M,517 

22 

10.8 

Jersey City, New Jersey 

299,017 

20 

6.7 

Kansas City, Kansas 

129,553 

14 

10.8 

Kansas City, Missouri 

456,622 

50 

10.9 

Knoxville, Tenn 

121,769 

13 

10.4 

little Hock, Ark 

102,213 

15 

14.7 

Long Beach, California 

250,767 

61 

24.3 

Los Angeles, California 

1.970,358 

371 

18.8 

Louisville, Kentucky 

369,129 

- 37 

10.0 

Memphis, Tenn 

396,000 

26 

6.6 

Mami, Florida 

249,276 

31 

13.6 

hlQwaukee, ITiscoiism 

637,392 

74 

11.6 

Minneapolis, Mina 

521,718 

76 

14.6 

Mobile. AU. 

Montgomery, Ala 

129,009 

9 

7.0 

106,525 

8 

7.5 

Kaahville, Tenn 

174,307 

23 

13.2 

New Bedford, Mass 

109,189 

13 

11.9 

New Haven, Conn 

164,443 

19 

11.6 

New Otleftriq t A . 

579,445 

40 

7X) 

New York City, New York. . . . 

7,892.957 

537 

J0.6 

Newark, New Jersey 

438.776 

62 

14.1 

Niagara Falls, N.Y.. 

191,555 

5 

2.6 

Norfolk. VireiTi?« 

213,513 

24 

11.2 

^lakland, California 

384,575 

58 

15.1 

Oklahoma City, Okla. 

243,504 

22 

9.0 

Omaha. NehmcV* 

251,117 

31 

12.3 

Pasadena. Calif.. . 

104,577 

21 

20.1 

Paterson, New .Terwv 

139,336 

19 

13.6 

Peoria, 

111,856 

15 

13.4 

Plukdelnhia. P». 

2J)71,605 

178 

8.6 

i^l^riix, Arirnna 

106,818 

25 

23.4 

Kttsbureh. Ps 

676,806 

78 

11.5 

Portland. Orevnn 


66 


I^dence, Khode Island. 


24 

9.6 

Keadine. ?■ 


18 

16.5 

Richmond, Yirginia. 

230,310 

42 

182 

R«kestcr, New York.. 

(Table continued) 

332,488 

47 



33 












Geoupinq of Data 


Table 3.1.1a Suicidei and Suidde Raiee, 107 Large U.S. Cities, 1950 


CiTT 

Population 

Akron, Ohio . . ... 

274,605 

Albany, N.Y 

134,995 

Allentown, Pa 

106,756 

Atlanta, Ga 

331,314 

Austin, Texas 

132,459 

Baltimore, Maryland . . 

946, 70S 

Baton Rouge, La. 

125,629 

Berkeley, California 

113,805 

Birmingham, Alabama 

326,037 

Boston, Mass 

801,444 

Bridgeport, Conn 

158,709 

Buffalo, New York 

580,132 

Cambridge, Maas 

120,740 

Camden, New Jersey 

124,555 

Canton, Ohio 

116,912 

Charlotte, N.C 

134,912 

Chattanooga, Tenn 

131,041 

Chicago, IHinoU 

3,B20,062 

Cbcmnati, OWo 

503,993 

Cleveland, Ohio 

914308 

Columbus, Ohio 

375,901 

Corpus Christi, Texas . . . 

108,485 

Dallas, Texas 

434,462 

Dayton, Ohio 

243,872 

Denver, Colorado .... 

415,786 

Dee Moines, Iowa 

177,965 

Detroit, Michigan ... 

1349,563 

Duluth, Minnesota . 

104311 

Elizabeth, New Jersey 

112317 

El Paso, Texas 

130,485 

Erie, Pa 

130303 

Evansville, Indiana. 

128,636 

Fall River, Mass. . . . 

111,963 

Flint, Michigan.. , , 

163,143 

Fort Wayne, Indiana. . . 

133,607 

Fort Worth, Texas . , . 

278,778 

Garv. Indiana 


Grand Rapids. Mich. 


Hartford, Conn. 

Vn^fiFl 

1 {Table continued) 



32 




13.1 Quaktixattve Yabubus 


procedure in the grouping of quantitative data, we shall analyze the suicide 
rates of 107 American cities of 100,000 population and over for 1950. 

Table 3.1.1a presents first the actual number of suicides in each of the 
107 major cities in the United States. It is e\'ident that such absolute 
numbers are not very revealing. Not unexpectedly, the huger cities have 
a larger number of suicides, and the smaller cities have a smaller volume. 
Clearly, the data are in a rather elementary stage of compilation.* 

In order to infuse meaning into the raw numbers, it is necessary to relate 
the number of suicides to the ^e of the population of the respective cities. 
Only then will the differential tendency of the cities toward suicide become 
apparent. This, of course, requires the calculation of a rate, which has 
the effect of noiming, or standardizing, all cities to the same population 
size, in this instance, 100,000. We thereby eliminate the variable of popu- 
lation size, which has now become a constant. Because of the small number 
of suicides in relation to the general population, the rate is computed on 
the basis of 100,000, thereby reducing the excessive decimal places, which 
are difficult to read and less quickly grasped. 

The indindual suicide rates of these cities (Column 3) have already 
been reduced to some order bj* an alphabetical amogement, which, is 
useful for many purposes. But the alphabetical position of a city rests on 
a quite arbitrary system which is totaUy unrelated to the magnitude of the 
rates. It is therefore not a statistical order; it is merely a cataloguing device 
for the ready location of the individual city in which one may bemomentarily 
interested. As far as a general impression of the pattern of distribution of 
suicide rates is concerned, the cities might just as well be listed at random. 

A ffrst step in the direction of putting the materials in orderly form is 
to arrange the rates in order of magnitude, from the lowest to highest, or 
the reverse. This is called an array, and is comparable to lining up a 
company of soldiers according to height, or sorting apples according to 
size. We have now lost both the population size and separate identity 
of the cities, and we are beginning to conceive of these values as ordered 
variates — which is essentially a statistical conception. 

This new Aiew of the data (Table 3-1-Ib), afforded by the arrangement 
in order of magnitude, renders a clearer picture of the concentration of 
values in the middle of the array. Indeed, the differences between any 
of these \-alues and its neighbors are usually so slight that it would seem 
pedantic to assign great significance to what seems to be mere chance 
variation. No sociological distinctions between cities could be drawn from 
• Although from the standpoint of grouping the data is this table are in a rather elemen- 
tarj* stage, actually «e should realize that, from the standpoint o! the process ot collec- 
tion of data, they are already in an advanced stage of compilation. The original sources 
are the death certificates, «hich are collected br the offices of vital statistics in the indi- 
vidual cities. These have been assembled, aummanzed, and then reported to the state 
and national offices of vital statistics, and fmally published as descriptive tables by the 
Federal Government. It is from these pubEshed tables that this materisl baa bees ex- 
tracted. 


35 



Grouping of Data 


Table 3.1.1a (Concluded) 


City 

(1) 

Population 

(2) 

Suicides 

(3) 

Rate per 
100,000 

Sacramento, California . . . 

137,572 

36 

26.1 

St. Louis, Missouri 

856,796 

90 

10.5 

St. Paul, Minn. 

311,349 

30 

9.6 

Salt lake City, Utah 

182,121 

23 

12.6 

San Antonio, Texas . 

408,442 

48 

11.8 

San Diego, Calif. . 

334,387 

70 

21.0 

San Francisco, Calif. . . 

775,357 

223 

28.8 

Savannah, Georgia . . . . 

119,633 

20 

16.7 

Scranton, Pa. 

125,536 

6 

4.8 

Seattle, Washington 

467,591 

124 

26.5 

Shreveport, La. , . . « . 

127,206 

11 

8.6 

Somerville, Mass 

102,351 

6 

5.9 

South Bend, Indiana ... . 

115,911 

26 

22.4 

Spokane, Wash. . 

161,721 

27 

16.7 

Springfield, Mass. 

16239 

8 

49 

Syracuse, New York, .. 

22033 

22 

lOJ) 

Tacoma, Washington 

143,673 

39 

27.1 

Tampa, Florida 

124,681 

31 

24.9 

Toledo. Ohio 

306,616 

46 

1S.1 

Trenton, New Jersey , . 

128,009 

24 


Tulsa, Oklahoma . . . 

182,740 



Utica, New York .... 

101,531 

13 

12B 

Washbgton, DC.. , . 

802,178 



Waterbury, Conn. . . 

1W,477 



Wichita, Kansas 

163,279 



Wilmington, Del. 

110,356 



Worcester, Mass. 

203,486 



Yonkers, New York 




Youngstown, Ohio 

168,330 

13 

7.7 


Grouty Procedure. Since age, wei^t, income, and various types of rates, 
are things of which there may be more or less, they are quantiUtive in 
nature. Cities may vary in amount of suicide, crime, or divorce; indi- 
"‘arriages may vary in dura- 
it .1^1, n random and chaotic; rather 

a usually conforms to some specific model which wiU emerge only after 
dividual Items have been judidously grouped. To illustrate the 
34 




S 3.1 Quantitatite Variables 


the class intervals. In this re^ject, there are no rigid prescriptions, al- 
though obviously the ungrouped array must be sufficiently compressed to 
achieve the purpose of grouping. Broadly speaking, grouping should be 
adapted (1) to the nature of the data, (2) to the objectives which the 
classification intends to serve, and (3) to the background and needs of the 
readers for whom it is intended. Thus, data embracing an immense range, 
such as the ages of the United States population, would require a different 
grouping than would the ages of American school children. The over- 
simplified statistics in the popular brochures of insurance companies, 
depicting the relative incidence of heart disease and cancer, may be very 
instructive for the lay public, but inadequate for scientific uses. A group- 
ing of data intended for a professional audience could be more intricate 
than one planned for popular consumpUon. 

In spite of this flexibility, it is customary to lay down a few technical 
rules which could normally serve as a guide unless specific conditions dic- 
tate otherwise. 

(1) The number of class intervab should ordinarily be no more than 15, 
nor less than 10. Fifteen intervals are usually sufficient to reveal the 
pattern of distribution, and yet not so numerous that it cannot be readily 
apprehended. On the other hand, if fewer than ten intervals are employed, 
the salient features of the distribution may be obscured. 

(2) The site of the interval should be a whole number, and, whenever 
practicable, of convenient divisibility such as 2, 10, or 25. Additionally, 
there may be some practical advantage attached to the use of multiples 
of 10, which is consistent with our decimal system. With units in feet and 
inches, minutes and hours, the requirement of adhering to the decimal 
system may of course be waived. 

(3) Intervals should, whenever possible, be of uniform width. They 
are then most readily grasped, and greatly facilitate further computation. 
However, the rule of equal intervals may at times have to give way in the 
face of special circumstances. Thus, the school population may call for 
age intervals of 6-13, 14-17, and 18-22, otherwise information essential 
to the school administrator would be concealed. Similarly, in the classi- 
fication of cities by size of population, or families by income brackets, 
uniformity of class intervals cannot be maintained. At the lower end of 
the range, variates are numerous and differences are small, while in the 
upper brackets, cases are infrequent and gaps are large. Hence, class 
intervals should increase progressively in size throughout the distribution. 
However, whenever unequal intervals are used, the larger intervals should 
be multiples of the smaller ones, espedally when graphs are contemplated. 

(4) The ends of a frequency distribution may be either open or closed. 
Although the ends are normally closed, it is often convenient to leave one 
or both extremes open, as in the National Office of Vital Statistics tabula- 
tion of births by age of mother: 


37 



Geoupino or Data 


Table 3.1.1b Arroyo/ Original Suicide Rales 


mm 

6.7 

8.6 

10.0 

11.1 

115 

14.1 

15.4 

'20.1 

2 

7.0 

86 

10.0 

112 

11.9 

14.1 

16.3 

mnm 

1 

7.0 

8.6 

10.0 

112 

125 

14.1 

165 

21.1 

1 

7.1 

8.9 

10.4 

112 

125 

145 

16.7 

22.4 

£ 

7.5 

9.0 

10.4 

1L2 

12.4 

14.5 

16.7 

23.4 

1 

75 

9.1 

105 

115 

125 

14.6 

165 

24.3 

1 

7.6 

9.1 

10.S 

115 

125 

14.7 

17.7 


S 

7.7 

96 

106 

115 

132 

145 

18.2 


s 

8.1 

9.6 

105 

115 

13.4 

14.9 

185 


1 

S2 

9.6 

10.8 

11.6 

135 

15.1 

185 


n 

82 

96 

10 8 

11.6 

13.6 

15.1 

185 


a 

84 

10.0 

10.9 

115 

135 

155 

19.5 

HI 


the numerous slight variations existing within the list. Not only ore such 
differences meaningless, but they actually clutter our view of the under- 
Ijing pattern of variation. Therefore, why not slough them off? Rounding 
the numbers would of course reduce the trivial diflereaees between the 
adjacent values (Table 3.1.1c). 


Table 3.1.1c Array of Suicide Rales, Rounded to Nearest Whole 



, ^ sufficient to eliminate all the confusion which 

thif !fw number of items. Furthermore, if one assumes 

it i oS' ° stiff more bulky and complex than this one, 

comorehr^iKu 4 ^ available to render this array 

identical or cnnt ° ^ **^i*^^ analyst. This device consists in grouping 

resulting fm "! “'to convenient class intorvals. The form 

e^ulung from this procedure goes by the name of frequency disMion. 

tabulation%^' proceed to the actual 
. must first determine the appropriate number and size of 






53.1 QuaJiUtattve Vakubucs 


there are no vacant intervals; and the outline of the distribution assumes 
the shape of a curve with a single peak in the middle. The suicide rales 
now lend themselves to a quick description as clustering around 11 or 12 
per 100,000 population, with variations below and above this figure. 

In this table, the values have been listed from low to high. This prac- 
tice, which is that of the United States Census and prerails generally in 
the tabulation of social statistics, differs from the convention employed 
by psychologists and educators who re\'erse the order of values. The 
difference is, of course, superficial rather than essential. In the tabulation 
of test scores and school marks it is perhaps more natural to begin with 
the highest score, against which all other scores are judged. In general, 
however, with such variables as age and income, one is likely to begin 
counting from the *ero origin, in conformity with habits acquired in child- 
hood and reinforced through daily use. 

Any frequency tabulation may at first produce t\>e impression that we 
are becoming more and more inexact in an endeavor to condense the data 
to a more manageable level. It ts true, every condensation docs some 
violence to detail. However, in view of the fact that the foregoing suicide 
rates are for a specific year, and that these rates in any event nill '"ary 
somewhat in the same city from year to year, there would be no purpose 
in insisting on such exaggerated exactness. We have merely discarded 
detail which is probably of no use anyway. Grouping is therefore not 
only more convenient, but is fully justified and actually necessary in 
order to display the fundamental pattern of events. If, in the interest of 
maneuverability, however, grouping is too broad and coarse, grouping 
tTTOTt may occur which must later be taken in account in the interpreta- 
tion of the materials. 

Reixndfd and True Limits* The cla.'W limits of the frequency distribution 
(Table 3.1. Id) have been designated by rounded numbers; 3-5, G-S, and 
^ on. However, according to prc\'ailing convention, a rounded number 
is itself considered the midpoint of some definite interval: 3 h the mid- 
point of the interval 2.5-3.5, and 5 is the midpoint of the interval •!. 5-5.5. 
It follows that the true limits of the first and successive class intervals 
'ould be written as follows: 


5.5- 8.5 

8.5- 11.5 


26.5-29.5 

• SotM employ terms “ wnlten" and “tiw" liartt this 

Sin<e’*lnie'’linutscMls‘,sndfrr«n»*ntly»rr',»-nlt«i 

And sinet the ••written” Lmits Are ArtuAJIy rounded limits, it -oiild seem r.'Kornt 

'•‘rsll “rounded” Ttlues by nArne. 


39 



Grouping op Data 


Mol&cr 
Uoder IS ye&M 
TMia 
20-24 yeua 
25-29 yeua 
30-34 yem 
35-39 years 

40-^4 years , 

45 years and over 

Such open ends are resorted to in order to avoid the necessity of itemizing 
small, irregular and essentially trivial frequencies which extend well beyond 
the limits of the significant range of the data. However, for technical 
reasons, it is a good rule to close the ends whenever possible. Otherwise 
subsequent calculations, such as the arithmetic average, are impossible, 
and graphing becomes awkward. 

Tabulalion. In grouping the suicide rates, the first step is to divide the 
range into an effective number of class intervals having a convenient width. 
The range of the array extends from 3 to 29, a spread of 27 points. A little 
exploration will reveal that an interval of five would yield too few groups 
to provide the des'ired discrimination, while an intervaVo! only one point 
would obviously not produce the desired condensation. We therefore 
select a 3<\mit interval as the optimum. Then, starting with a multiple 
of 3 as the lower limit, we write down the class limits and proceed with 
the tally as shown in Table 3.1.ld. 

Table 3.1. id Frequency Tally of Suicide Fatet 


Rounded 
Class Limits 
W 

Talet 

Frequenct 

(/) 

3- 5 

■m / 

6 

6- 8 

■mm mm 

18 

9-n 

m m mmm mi 

29 

12-14 

m mm m mi 

24 

15-17 

■m m /// 

13 

18-20 

m 11 

7 

21-23 

//// 

4 

24-26 

nn 

4 

27-29 

n 

2 

N = 107 


This frequency distribution of euidde rates commends itself to common 
statistical sense. The frequendes are neither too large nor too small; 
38 





Gsocping of Data 


Age ej iSoQiff 
Under 15 years 
15-19 years 
20-24 years 
25-29 years 
30-W years 
35-39 years 

40-14 years t 

45 years and over 

Such open ends are resorted to in order to avoid the necessity of itemizing 
small, irregular and essentially trivial frequencies which extend well beyond 
the Lmits of the significant range of the data. However, for technical 
reasons, it is a good rule to close the ends whenever possible. Otherwise 
subsequent calculations, such as the arithmetic average, are impossible, 
and graphing becomes awkward. 

Tabulation. In grouping the suicide rates, the first step is to divide the 
range into an effective number of class intervals having a convenient width. 
The range of the array extends from 3 to 29, a spread of 27 points. A little 
exploration will reveal that an interval of five would yield too few groups 
to provide the desired discrimination, while an interval of only one point 
would obviously not produce the desired condensation. We therefore 
select a 3-unit interval as the optimum. Then, starting mth a multiple 
of 3 as the lower hmit, we write down the class limits and proceed with 
the tally as shown in Table 3.1.ld. 

Table 3.1.ld Freijuency Tally of Suicide Soles 


Rotodeo I 
Cuss Limits 

Tallt 

FREQUiNCt 

(/) 

3- 5 

■m / 

6 

&- 8 

m mm m 

18 

9-n 

m m m mm uu 

29 


m mm m //// 

24 

15-17 

m m /// 

13 

18-20 

m II 

7 

21-23 

1/1/ 

4 

24-26 

nil 

4 

27-29 

II 

2 

N « W7 


This frequency distribution of suidde rates commends itself to common 
satistical sense. The frequendea are nel^er too large nor too small; 
38 






i 3.1 QUAKnTATTTZ VaBUBL£S 


higher interral. For example, observed 6.0 becomes “rounded” 6 and 
hence joins the interval 6-8. 

llidpoint of Class Interval. Since it is impossible to subject a class interval 
to further calculation, it is often necessary to designate a single value for 
all the items which are included within its boundaries. The selection of 
this value must be guided by the prinnple of representativeness as well 
as expediency. On the assumption that the items are more or less uni- 
formly distributed between the class boundaries, a point half way between 
the true boundaries would satisfy our needs. Accordingly, we select the 
midpoint of the true interval to represent all the values within the interval. 
In effect, we round off all individual values to the interval midpoint. 
But by that maneuver, we introduce grouping error, owing to the fact 
that few if any of the observed items within the interval will fall exactly 
at the midpoint. When items are evenly distributed, the errors produced 
by arbitrary grouping will offset one another; however, when the items 
are bunched in one end of the interx’al, the errors will not balance one 
another and the cumulated grouping error will infect subsequent calcula- 
tions such as the mean. Hence, grouping error is defined as the amount 
by which the average value of a set of grouped data diverges from the 
average value of the same data in ungrouped form. ^Tien there is reason 
to believe that grouping error is severe, correction formulas should be 
Applied 60 that averages derived from grouped data will more closely 
approximate those of the original, ungrouped data. 

To find the midpoint of any interval, we divide its true width by two, 
and add the result to the true lower limit. An equivalent arithmetic pro- 
cedure is to add directly the values of the true limits, and divide by two. 
The procedure is applicable irrespective of the manner of rounding or the 
type of data (whether continuous or discrete). To illustrate, we present 
in Table 3.1.2 a set of rounded limits (3-5, 6-8), whose true limits and 
midpoints vary, however, according to (he prior rounding procedure. 


Determination of Interval Slidpoint, Rounding to Nearest 
ToMe 3.1.2 Whole Number and Last irAofe Number 


Rocxdixo 



Cuss 

O.ve-ILilt 

InitrtaL 

PnocEDran 

Liutrs 

Limits 

Width 

Class Width 

Midfoint 

Ktarwt whole 


2.5-5.5 

3 

1.5 

4 

number 


5.5-S.5 

3 

lA 

7 

bast whole 

3-5 

3.0-6i) 

3 

• 1.5 

45 

number 

6-8 

6.a-9i) 

3 

1.5 

75 










Groupino or Data 


It is evident that the true limits, whidi are of course spaceless points, con- 
stitute the common boundaries betpreen adjoining class intervals and 
thereby assure the continuity of the scale, as in Figure 3.1.1. 

.3 5 6 8 . . , 27 29 ,30 

H — + — H -H — X 

f.5 6£ SS £8.S S9-B 

True Tnu Trve True True 

Figure 3 1.l Rounded and True Limite 
At first glance, the common boundaries might seem to confuse the task 
of grouping In ivhat interval, for example, should we locate an observed 
value of 5.5, which falls right oa the boundary of contiguous class intervals? 
The resolution of this dilemma haa already been offered: rounding to the 
nearest even value, we would place it in the interval 6-8. 

It will become increasingly apparent that the true limits are the only 
limits with which we can do any statistical business. We may accordingly 
raise the ciuestion: why carry along the rounded limits in the vocabulary 
baggage? The answer is threefold: (1) rounded limits are easier to read, 
whereas the more detailed but accurate true limits “clutter up” the table 
visually; (2) rounded limits ate adequate for tallying purposes; and 
(3) they can always be transformed into true limits whenever necessary. 

Id order to reconstruct the true limits from the rounded, it is of course 
necessary to know the prior rounding procedure: whether to the nearest 
or the next lower unit, which is not evident from the outward appearance' 
of the rounded numben. From the appearance of the follomng rounded 
class limits, for example, it is not evident whether the suicide rates were 
rounded to tbe nearest or last whole unit: 

3- 5 
6- 8 
9-11 

Had they been rounded to the nearest whole number, the true limits would 
be written- 

2.5- 5.5 

5.5- 8.5 

8.6- 11.5 

But had they been rounded to the last wh<^e unit, the true limits would 
read: 

3.0- 6 0 

6.0- 9.0 

9.0- 12.0 

IVhere rounding is to the last unit, and where the observed value falls on 
a undary, “drop the zero” and thereby place the item in the next 
40 



{3.1 QcA.vnTATiTz Variables 

Ttehnique 0 / Table Conslnulim. We must always anticipate that statistical 
materials will ultimately be presented in orderly tabular form to some 
audience. In order to ensure maximum intelligibility and effectiveness in 
display, it is necessary that such presentation conform to well-established 
contentions that prescribe the structure and content of the table. As it 
stands, Table 3.1.1d is still a working table, and not yet wholly presentable 
to its public. It must still be refined in several respects: a substantive 
title must be attached, the temporary tally marks removed, and clear 
captions provided. 

The title must convey in plain manner the content of the table; there- 
fore, the first term of the title should normaUy designate the subject matter 
of the table. Then should follow as needed, in order of priority, an indica- 
tion of the classification of the materials, the geographical area and the 
period of time to which they pertain. To express it patlj’: the elements 
of the title should be arranged in order of the "what, how, where, and 
when" of the tabular information. 

Architecturally, the table consists of three parts: $tub, caption, and body. 
The stub, located in the left sector, where the Occidental eye usually looks 
Erst, exhibits the primarj' classification of tbe materials. It should always 
be identified by its own heading. The body, to the right of the stub, com- 
prises the frequenci«is, inserted in tbe celb created by the intersections of 
the rows and columns. The row frequencies are identified by the stub, 
while the column frequencies are designated by the caption, which consists 
of the one or more secondary classifications with appropriate labels. To 
illustrate this conventional arrangement of parts, we reproduce in Table 
3.1.4 a typical United States Census table. 


Sehool EnroUmeiit Rales by Age, Selected 
Table 3.1.4 Years, 1910-1957 



l^ee: Donald J. Bogue. Tkt Populatien pf tlu> UnMtd SlaUt. The Free 
*ress, Glencoe, lU., 1959, p. 339 (adapted). 


Frequencies. It is only by way of comparison with other distribu- 
ous that a given frequency distribution takes on meaning. But it is 

43 



Groopino or Data 


Since the rounded values are indistinguishable for the two types of 
rounding, care must be exercised in reading correctly the rounded value. 
If, for example, rounded 3 is Bustakenly read as 3.0— 4.0, instead of 2.5-3. 5, 
the midpoint would be located one>haIf unit too high. 

Discrete values follow the same rules of grouping as are applied to con- 
tinuous data, except for a nun» technical adaptation. Since a discrete 
variable jumps from one integer to the next instead of moving continuously 
over a specified range, a discrete variate is and remmns a whole number. 
In certain situations, however, we must make the factitious assumption 
that a discrete variable is after all continuous and that the observed values 
have been rounded to the nearest whole. True limits and class midpoints 
calculated on this assumption are illustrated in Table 3.1.3. 

Kumber of Dwelling Unilfper Block, 

TahU 3.1.3 Los Angeles, 1910 




|ai QuANTITATm YaRIABLES 

ToNe 3.1.6 Cumulalirt Age DistrUmtim, V^. Popiilatum, 1950 


Age* 

Per CE^T 

“Less Teas” 
CoruiATTTE Per Ceist 

“Or More” 
CcircLATiTZ Per Cest 

0-4 

10.7 

10.7 (less tluuT^ 

100.0 (0 or more) 

5-9 

S3 

193 (less than 10} 

893 (5 or more) 





15-19 

7J0 

33.9 

73.1 

20-24 

7.6 

413 

66.1 





30-34 

7.6 

573 .*... 

60.4 





40-44 

63 

713 

353 





50-51 

S3 

83.0 

223 

65-59 

43 

873 

173 

60-64 

4.0 

913 

123 

• 65-69 

33 

95.! 

S3 

70-74 

23 

97.4 

4.9 

75 and ever 

2.6 

1003 (less than 100) 

2.6 (75 or more) 


* Ao woadrf to Iwt whol* y«v. 

SWW! UA Borau «t th« VS. Ctnma tftitt Fcpvletien: 19S0, VpL II. 

P»rt I. Unittd SUiU* Stimmarv, VS. OoTtmaeol Priatof OSee, 

n»Aifl*ton, D.C, IM3. 

population b less than 5 years of age; to find what percentage of the popu- 

btioa b less than 10 years of age, we cumulate the first (tro class frequencies: 

10-7% + 8.8% ■ 19.5%. All successive cumulative frequencies are eimi- 
l^ly obtfuned. The highest interval, however, offers a slight difficulty, 
once it has no spedfied upper limit — it b “open ended.” In fact, the 
United States C^us finds the scatter of items in the higher ages so thin 
that it has already cumulated the last several intei^'ab on an “or more 
To provide an upper limi t for the “less than” cumulation, it b 
P^^niissible in this instance to close the interval arbitrarily at 100 years. 

The “or more” cumulation b read from the true lower limit of the inter- 
^ Thus, 2.6 per cent of the population b 75 or more. In order to 
^tain the per cent of population 70 or more, we cumulate the frequencies 
« the two highest intervab: 2.6% + 2.3% = 4.9%; and so on to the 100 
P*r cent cumulation. 

From these two tables, any number of useful readings may be made. 
•2 per cent of the population b 65 years and over; 49.6 per cent of the 
population k under 30 5-ears of age. Comparable questions could be 
f^ered from the appropriate tabulations: what proportion of famih« 
“Te incomes of $10,000 and over? what proportion of families have six 
children? what proportion of unemplo5'ed persons have been 
out work for 6 months or more? 


45 








Grouping or Data 


difficult to compare two or more differort distributions that have widely 
varying totals. To norm them, or nmfcB them comparable, the absolute 
distributions must be reduced to relative, or percentage distributions. 
The conversion of an absolute to a percentage frequency is accomplished 
by dividing the class frequency by the total, and multiplying the result 
by lOQ — a familiar arithmetic procedure. Table 3.1.5 illustrates the 
convenience of this method of comparison. 


U.S. Population, by Age, 1880 and 1950 



*** 19S0. Vol. II. Charae- 

^ ^ Summarv U.9. Cov.«n>.Bt Prmtlas OSce. 

Cum^tive Distribution. At times we may simply want to know the 
number or percentage of items above or below a certain cutting point on 
tne scale. A welfare administrator, for eiample, wiU want to know how 
mny prsons are 65 years or older. Such results may be readily obtained 
from the cumulative frequency distribution, which is derived from the simple 
fre^en^ distributton by a process of me^ng successive class intervals 
ul/i, b«n cumulated. Such a procedure reveals the frequency of 
tirtint ♦ 1^ ^ boundaiy, and thus permits the cutting 

Lc convenient posiUon according to the inter- 

ests and needs o the r^der. Table 3.1.6 -a work table - demonstrates 
In ^ ^bulatira IS denved from the simple serial table, 

hiihprtri! •” “ ““ “igm eilter Uis lowest or 

uS r .? “ ° ““y I” con™Uonallr 

distribution ' ^ tumnlative frequency 

P“ »' Popolalion below 
tte l„. upper bunt of each class inter™!. Thus, 10.7 ^r cent of the 








$ 3.1 QTJANTTTATItE VaBIABIXS 

Table 3.1.7 DelinqueJiq/ Rates, I 40 Local Areas, Chicago 


OS 

1.6 

15 

25 

185 

55 

0.6 

OS 

1.9 

2.9 

3.7 

4.9 

5.1 

0.6 

1.5 

15 

4.6 

S5 

2.7 

1.7 

2.1 

2.6 

0.9 

35 

7-4 

25 

1.5 

2.2 

1.0 

2.4 

3.0 

115 

55 

2.6 

3.7 

0.7 

1.7 

1.9 

125 

8.8 

4.0 

15 

05 

2.2 

4.4 

12.1 

18.9 

55 

3.9 

1.1 

2.2 

2.9 

SJb 

25 

9.4 

0.6 

05 

2.3 

3.4 

45 

2.7 

2.4 

1.9 

05 

45 

4.8 

5.0 

35 

25 

4.6 

1.4 

1.6 

6.1 

5.7 

7.0 

35 

15 

1.9 

1.2 

75 

11.9 

13.4 

25 

1.3 

25 

2.1 

2.7 

95 

175 

25 

15 

15 

1.9 

3.1 

145 

4.5 

2.5 

6.0 

1.1 

2.2 

5.2 

25 

2.7 

3.9 

45 

0.9 

2.9 

9.0 

2.1 

3.4 

15 

45 

0.7 

25 

11.4 

5.0 

45 

05 

2.0 

1.6 

4.2 

95 

5.1 

9.4 

0.6 

25 

3.1 

3.1 

12.1 

3.7 

145 

15 

2.1 

2.1 

1.9 

2.7 

55 

1.6 

3.0 

25 


®^'*r**: Clifford R. Shiw ond Hour]' D. ilcKoj. ^uwniZ* Delin^iienev end yrton ArMi, Tho 
tniretaty of Chicsfo Pro*. Chi«»|o. 1942. p. 53. 


12. Calculate the width and midpoint of each of the following inten’als, assuming 
that data are rounded to (a) the nearest unit and (b) to the next lower unit: 


6- 8 2-5 0-4 
6-11 6-9 5-9 
12-14 10-13 10-14 


13. From the given class limits and midpoints, determine: 
(a) whether class limits are true or rounded 
(h) the sire of the class interval 
(c) the manner of rounding, if any. 

Limits Midpoint Limits 

20-24 22.5 0-9 

25-29 27.5 10-19 

14.5- 19.5 17.0 .5-2.4 

19.5- 24.5 22.0 2.5-4.4 

6- 8 7.0 

&-11 10.0 


Midpoint 

4.5 

14.5 

1.45 

3.43 


47 




Grouping or DiT* 


Questioss and Problems 

1. Define the following concepts; 

Array 

Quantitative Grouping 

Frequency Distribution 

Class Interval 

Class Width 

Rounded limits 

True Limits 

Class Midpoint 

Grouping Error 

Tabulation 

Stub 

Caption 

Body 

Per cent Frequency 
Cumulative Frequency Distribution 
Cumulative Frequency 
"Less Than’’ Cumulation 
"Or More” Curaulatwn 

2. (a) State considerations goverobg the choice of the claas interral. 

(b) Why is it good practice to make class intervals equal in else? 

3. (a) ^'hea are unequal class intervals appropriate? Give several examples. 

State considerations governing the constniction of unequal Intervals. 

4. (s) When are small class intervals appropriate? 

(b) Illustrate and discuss large claas intervals. 

(e) Are class intervals intrinsically small or targe? 

5. Must the lowest class interval in every tabulation beg:in with aero? 

6. Distinguish between rounded and true class linuta. , 

7. Is the distinction betneen rounded and true class limits applicable to discrete 
data? Explain. 

8. Give two illustrations of groupings "adapted to the nature of the data.” 

9 . IIow do you determine whether there ate too many (few) intervaU? 

10 (a) How do you recognize whether claas limits are true or rounded? 

(b) From the rounded limits is it possiUe to determine the manner of rounding? 
Explain. 

11. The delinquency rates listed below are for 140 local areas in Chicago fTable 
3.1.7). Each variate is the nuiDberof boya per 100 making a court appearance. 
fjO, 

(b) Construct a frequency taUe: rounded lower limit, 0; class width, 2. 

W Write the true class limits, 

W) Convert class frequencies to per cents, which should sum to 100. 

46 



i3.l Quantitatite Variadus 

16. Construct an "of more" cumulative frequency distribution from Table 3.1.9. 


Table 3.1 .9 

Di^ribuiion of Families by Size, 
VjS., 1950 


She op Familt 

Per Cent 

2 persons 

32.5 

3 “ 

22.8 

4 “ 

20.9 

5 " 

12.3 

6 " 

5.9 

7 or more 

5.6 

Total 

100.0 


Sourco: Currenl Ptpulalion Reporti, Popvia- 
tioH Ckaraeterultei, Sen** P-20. No. 75. U.S. 
Govemmeot Pnnting Offica, Waslungton. 
D.C.. June 9, 1957. 


17. (») Express the number of Iqi&ds in each age group (Table 3.1.10) as a 
percentage of the total. 

(b) Combine percentage frequencies as necessary to compare the age distribu- 
tion of Indians in Table 3.1.10 with the 1950 United States age distribution 

of Table 3.1.5. 


J’aJIe 3.1.10 

Aje lyistribulion, American 
Indians, U.S., 1950 


Aoe 

Number 
(in TaOUSANPS) 

0- 4 

52 

5- 9 

44 

10-14 

44 

15-19 

34 

20-24 

30 

25-29 

23 

30-34 

20 

35-39 

19 

40-44 

15 

45-19 


50-54 


55-59 


OO-M 


65-69 


70-74 


75 and over 


Total 

312 


loorce: U.S. Bureau pi Vo! IV 

>.C.. 1953. p. 17. 


49 






§ 3^ Quaijtati\i Variables 


tribution of the variable, analogous to the frequency distribution of the 
variates of the quantitative variable. 

An example is provided in Table 3.2.1, where the population of the 
United States is grouped according to the qualitative variable of religion, 
whose attributes are the respective church denominations. 

Compared to the quantitative tabulation, this type may appear rela- 
tively uncomplicated. Absent are class intervals, midpoints, rounded and 
true limits; there is no dicbot<uny of continuous and discrete data; there 
is no natural, inevitable order or array of the attributes, and therefore they 
cannot be cumulated. It is merely an enumeration of the frequencies of 
the respective attributes. But in spite of this apparent simplicity, these 
tabular classifications are also bound by certain rules that are designed 
to carry out the purposes of all clasrihcation. These purposes are: to 
organize a body of facts as completely as necessary and as unambiguously 
as possible ; to set forth clearly the attributes and their relative frequencies, 
or vttghts. To achieve these ends fully, it must be (1) possible to classify 
e\'ery item, and (2) impossible to classify an item in more than one cate- 
gory. 

Principlea of Claasifieatim. The first rule, sometimes labeled the principle 
of tnefusiveness, lequiies that the classification must contain all categories 
necessary to accommodate every item. It must exhaust the data so that 
the frequencies add up to 100 per cent. Any omission of items would have 
the effect of distorting the relative weights of the respective attributes, 
thereby producing misleading impressions. Hence, U for any reason some 
items remain unclassifiable — their identity may be unknown, or perhaps, 
in the interest of brerity, not required — they must still be accounted for 
in residual categories of “unknown” or “all other,” so that the ratios 
among subgroup frequencies ate preserved. This procedure is illustrated 
in Table 3.2.1. 

The second rule demands that all items be subsumed under a single well- 
defined criterion, which is the subject of the table. Otherwise expressed, 
all items must fit one variable »clurively, and each item must fit only 
one attribute. UTien these conditions are satisfied, it Viill be impossible 
to classify an item in more than one categoiy. This is known as the 
principle of ezclusiventss. But strict compliance with this principle will 
be impossible when (a) two or more variables are improperly mixed within 
a “single” classification, or when (b) categories overlap one another — 
that is, when they are not mutually exclusive. 

Common T’’»olafion* of PrincipJea: (1) Mixed Variablea. The classification 
of “races” as While, Negro, Indian, Mexican, and Jevrish, occasionally 
met with in less rigorous descriptions of American population, is defective 
in that the categories do not possess a common denominator. There are 

51 


GftOTJPlSQ o? Datk 

18. Construct an “or more” cumulative distribution for the absolute frequencies 
given in Table 3.1.11. 


Table 3.1.11 

Diftnhvtton of Familita by 
Income, U-S , 1933 


Sour«« US. Boreau o( th« Ctnnu. Current 
Fop>italw> Ktyoru. Canaumcr I neeme. Striea 
P-CO. K». 2*. VS. Covemment Printing 
OSm. Wuhington, P C., April, i9S7. 

W. RevK’fc the wmputation oJ an arilhreetic tat* (TaWe 3.1.1a). Express the 
foltovisg absolute counts as rates* 

(a) ConuDunity population, 28,362, oumberof births, 672. Calculate the birth 
rate per 1,000 general population. 

(b) Cotamvutity population, 600,212; rramber ol sulcldea, 50. Calculate the 
suicide rate per 100,000 general population. 

(c) 17,253 marriages; 4,C99 divorces. Calculate the number of divorces pet 
100 marriages. 

20. Write the true limits and cla.'is midpoints for TaUe 3.1.3 (distribution of blocks 
by number of duelling units, Los Angeles). 


FaunT IscouE 

NUMBES 
(IV Thocs-^nds) 

Under $1,000 

3,300 

$ 1,000-f LOW 

4,200 

2,000- 2.099 

4,700 

3,000- 3,099 

6,500 

4,000- 4.999 

6,600 

5.000- 5,999 

5,400 

6.000- 6,999 

4,100 

7,000- 9,999 

5,500 

10,000- 14,999 

2,100 

15,000 and over 

600 

TOTAt 

42,800 


Section Two 

Qualitative Variables 

If a given descriptive term cannot be graded on a scale of more or less, 
higher or lower, or smaller or larger, we designate it as qualitative, to dis- 
tinguish it from the quantitative categories set forth in the previous section. 
Thus, sex is classifiable not in tenns of more or less, but rather as answers 
to the questions- “which one?” or “what kind?” Other qualitative 
variables, c g., nationality, reltgjous affiliation, marital status, crime, 
and occupation, are also dasmfiable into their several attributes. 
The frequencies of the respective attributes constitute the frequency dis- 
50 




§ 3.2 Qualitattve Variables 


responses reducible to several variables. As Zeisel notes, "some answers 
refer to the qualities of the cream, some to how the respondent became 
acquainted with the cream, and some indicate the respondent’s special 
needs." In general, unless the attributes are reducible to a common de- 
nominator, it is impossible to maintain their mutual exclusiveness, and 
the comparison of their weights will be, to that extent, vitiated. A more 
appropriate procedure would be to tabulate separately each variable by 
its attributes, and thereby display the relative importance of reasons of 
the same essential kind. Thus, it is meaningful to compare the weights 
of alternative sources of information (Table 3.2.2b), but not to compare 
quantitatively a given source of information with a quality of the cream. 


Table 3.2.2b 

Source 

Per Cent 

Responses by Source of 

Recommendation 

27% 

Information, American 

Heard it advertised over radio. . 

17 

Women 

Saw It oa the counter 

14 


No answer in thb category 

42 


Total 

100% 


(2) Overlapping Calegarxes. In the foregoing example, the confusion 
was produced by mixing distinct variables. However, in Table 3.2.3, the 
disorder stems more from overlapping categories: the tabulated reasons 
for "regret at, failure to marry” are not mutually exclusive within the 
same variable. In fact, in so far os uey are not virtually s>’nonymous, 
some are already implied, or embraced, by others. The first item is actually 
a synthesis of many of the remaining categories; it is virtually impossible 
to distinguish at all between No. 3 and Nos. 5, 10, 11, 12, and 15. 

Now, the reader is almost bound to compare the percentages with one 
another, irrespective of the author’s intentions. However, these per- 
centages are not strictly comparable; Item 1 is not three times the sig- 
nificance of Item 7, since an unknown number of elements in Item 7 are 
already included in Item 1, thereby inflating the magnitude of Item 1 in 
comparison to Item 7. It is as though a group of individuals were given 
the opportunity to vote their preference, with the following results: 


Apples 47% 

JonatbsDS 25 

Winessps 18 

Grimes Golden 10 


100 % 

No one would declare that Apples were approximately five times as popular 
as Grimes Golden. Because of the substantive overlap, the weights of the 

53 




Crocpin'O or Data 

/iduf/ Pop'«fa<i«n by Rdigian Rcportfd, Sample IJ.S., 
Table i:2.1 1957 


Itf-UGIOV 

Ncubcr 

Per Crvr 

Protpstant 

78,052.000 

60 2% 

I\aptist 

fS,5fSA» 

19.7 

Lutheran 

S.tl7M} 

7.1 

bfethodist 

tsfirefioo 

tiO 

Prfsbjtrny) 

0^6.000 

S.D 

Other Protp“Unt 

M.ersfioo 

J93 

BomAn Catholic 

ao.M-ojioo 

25.7 

Jewish 

3,g&S.0(M 

32 

Other religion 

1,515,000 

1.3 

No religion 


2.7 

Religion not Kported 

i.iot.ooo 

00 

Totai, 

n»mooo 

1000% 


Bourf« t'Jl nur««g «t the C»nri*. CvntKl rarblufM* Kti»rl4. Sent* T-JO. No. T9, VSt. 
CortrntMnl rnitmi U uIiiACton. D C . lUH 


at least two dimensions which compete for U» assignment of items: the 
ph^'sieal race and the cultural nationafity. Many Mexicans, and nearly 
all Jens, are Caucasoids; other ^tccicans are Indians, or Mongoloids. 
Similarly, Fern's famous, but now ob«olct« classification of erimirwU as 
Insane, Bern Crifninal, Habitual, Orcastonol, and Criminal by Possfen ts 
e\-cn more difficult to apply. Biologic, pcrsoiolity, nnd cultural variables 
are indi«crimiiiately muni in the same cbssiJieation. 

Another example of mixed x-ariablcs is provided by Table 3 2.2a, in 
which reasons for buying (acc cream arc ect fewth along with the percentage 
of women mentioning each one. While this contrived tabulation may 
appear to be free from faults, actually it is a collection of miscellaneous 


TaWe 3.2.2a 

Rtaiont Gh-enfor Buyiny 
Face Cream, S50 AmeTi~ 
con iromm 


52 


RcAaoit 

VraX^zyr 

Recommendatkin 

«»7r. 

Broefirialtoakm 

20 

IIcardUadTertiscd over radio. . . 

17 

Saw H on the <»untcc 

U 

UcafloosUy priced 

g 

SccBtunnealtyf 

7 

Because of special skin condition. . 

6 

Total 

100%, 


Sour*.: Man* ZcM. A frith 4tii td , Btmar 

i Broib.™, K«> York, 1957. p. l«5 (►l.pwd). 









5 3J2 Quautattve Variables 


to know: who his family is; how much money he has; what sort of educa- 
tion he has; or how be believes and feels about certain things?” Their 
replies were summarized as shown in Table 3.2.4. 

The layout of this table pves cv«y appearance of being a convenUonal 
frequency distribution on one specific rariable. The format entices the 

Table 52.4 

Critena for Social Class 
Membership, Sample of 
U.S. Population 


People eav« 

more tben ooe aiisver. 

Source: fUchuiCeMn.ThtPt^^cholenffSodalClaticM, 

Pnoeetoa Ooivertitr Pme. Princeton, HJ,, 1949, p. 61, 

leader to read down the column and spontaneously to add the percentages, 
which do not sum to 100, but rather add up to almost 130. In fact, in a 
footnote, the author warns a^inst obeying that impulse, since an unknown 
number of persons were tabulated in more than one category. However, 
that device nullifies the very purpose of the table; namely, to establish 
the ‘relative importance of the various class criteria. The author treats 
the tabulation os proof that subjective beliefs, rather than objective cir- 
cumstances such as money, family, and education, are the predominant 
earmark of social class. Multiple entries, however, introduce an element 
of confusion which disallows such a pat conclusion — and this for two 
reasons: (1) plural votes are accorded equal weight, which they almost 
certainly do not deseiwe; and (2) the uncontrolled number of entries which 
were permitted each respondent may refiect the imagination and loquacity 
of the respondents as much as their personal convictions. In the extreme 
case, if all the respondents had voted for all alternatives, there would have 
been no differentiation at all. 

To avoid such a dilemma, most investigators prudently restrict the 
ans^vers to the “one best response,” or tabulate the responses by prefer- 
ential order. If this had been done, it is conceivable that the criterion of 
‘‘Behefs and Attitudes,” which was mentioned by nearly half the group, 
and hence collectively takes first place, might rarely or never have been 
given first place by the separate individuals. Thus, the tabulation of the 
“one best response” could have reduced, or even wiped out, the attribute 
which now stands at the head of the column. Of course, if for any reason 
multiple responses are wanted, then it is incumbent on the research designer 

55 




Croctixo Of Data 


Rea$(yn$ for Rfgret at Fmlvre to Mamj, Female ColUge 
ro6f« 3.2.3 Graduates, i028 


Reason 

KCUBER 

Per Cent 

1. Belief m.xrriage normal life for woman 

1C7 

31.4% 

2. Desire for children 



3. Desire for husband . ... 



4. Desire for husband and children 



5. Desire for home 



6 Desire for home and husband 



7 Desire for home and children 

67 

10.7 

8. Desire for home and family life 

2 

■6H 

9 Desire for husband, home, and children 

33 


10 Believes would be happier married .. . 

14 


11. Desire for phj’sieal relationship . 

14 


12 Dc'irc for named life 

5 

WEM 

13 Regret having missed a richer, fuller experimee 

2S 


14 Have dometUe tastes 

6 


16. Dread ot loneliness 

12 


IG. Regret not having found the ngbt mao. 

0 


17. Reason not given by olher* 

21 

KM 

Total 

wm 



bourrc Ktlh(rin« l!«rn(nt D«vkf. "Abx They to Horry," Uarfrrr. CLVI. 1R38. 
p, ACcB Udiptcd). 


tabulated percentages do not reliably rcflocl iVie relati''e importance ot 
the professed choices. In general, whenever the respondents h.ave not 
been provided with a discrimin-ating schedule to express themselves in an 
unambiguous way, we must provide major and minor subheads os exempli' 
fied in Table 3.2.1, which distribute the detailed characteristics as sub- 
totals in italics. 

Neither one of the I.xst two tables Is therefore a table in the model scn'c. 
They are only a listing of one-way percentages, to bo read across the rows 
rather than vertically, since vertical comparisons are meaningless. If it is 
agreed that the intention of n (able is to eet forth relative wcight.s, it b 
absolutely es.'tential that categories fit one variable, and that they be 
mutually exclusive 

(3) Multiple Entries. WTien we tolerate mixed variablc.s or overlapping 
attributes, there will result the tempting opportunity to make more than 
one entry per case. This will, of course, result in n lar^r number of 
entries than there are respondenis, with consequent still greater confu- 
sion in relative weights of attnbutes and uncertainty in interpretatinru 
In a study of social class, persons were asked: “In deciding whether a 
person belongs to your class or not, which (d these things b most important 
54 







§ 3^ QuAtiTATiTE Variables 

to know: who his family is; how much money he has; what sort of educa- 
tion he has; or how he believe and feels about certain things?” Their 
replies were summarized as shown in Table 3J1.4. 

The layout of this table gives every appearance of being a conventional 
frequency distribution on one spedfic variable. The format entices the 

Table 3.2.4 

Criteria for jSorioI Class 
Membership, Sample of 
U.S. Population 


* PereeatAen&ddtomoretliAD 100%. People ofteD cat* 
more then ooe ensirrr. 

Sourve; RSduwdCeBlm. TUPeveWeppe/SerfaJCIOMf*. 
Pruiceton OuTemtr Ptees. Priaceton, SJ., 1949, p. 91. 



reader to read down the column and spontaneously to add the percentages, 
which do not sum to 100, but rather add up to almost 130. In fact, in a 
footnote, the author warns against obeying that impulse, since an unknown 
number of persons were tabubted in more than one category. However, 
that device nullifies the very purpose of the table: name!}*, to establish 
the 'relative importance of the various class criteria. The author treats 
the tabulation as proof that subjective beliefs, rather than objective cir- 
cumstances such as money, family, and education, are the predominant 
earmark of social class. Multiple entries, however, introduce an element 
of confusion which disallows such a pat conclusion — and this for two 
reasons: (1) plural votes are accorded equal weight, which they almost 
certainly do not deseive; and (2) the uncontrolled number of entries which 
were permitted each respondent may reflect the imagination and loquacity 
of the respondents as much as their personal convictions. In the extreme 
case, if all the respondents bad voted for all alternatives, there would have 
been no differentiation at all. 

To avoid such a dilemma, most investigators prudently restrict the 
answers to the “one best response,” or tabulate the responses by prefer- 
ential order. If this had been done, it is conceivable that the criterion of 
“Beliefs and Attitudes,” which was mentioned by nearly half the group, 
and hence collectively takes first place, might rarely or never have been 
given first place by the separate individuals. Thus, the tabulation of the 
“one best response” could have reduced, or e\’ea iviped out, the attribute 
which now stands at the head of the column. Of course, if for any reason 
multiple responses are wanted, then it is incumbent on the research designer 

55 



Geoppinq or Data 


to identify and tabulate them accordingly eo that clear communication Is 
assured 

The Concept 0/ 0 Table. U is clear that there is more than mere “ehape" 
to a table, and that its construction requires more than routine skill in 
layout and spatial arrangement. A table is much more than a mere listing; 
it is an organization of the daU. A well-built table constitutes the pith 
and meaning of a study in its definiUve form. Consequently, serious con- 
sideration of its form and content should not be postponed to the final 
stages of a study, when the tabutor report becomes due. To assure advance 
consideration of tabular requirements, the preparation of dummy tables 
is recommended. We must recondle ourselves to the fact that the ultimate 
reader may lift the table out of context for purposes of analysis and quo- 
tation; it inuat therefore be as neatly self-contained and autonomous as 
possible 

Adoption of principles of sound tabic construction, furthermore, wiU 
enforce sound conceptual practice. The anticipation of rigorous tabulation 
will promote rigorous research design; on the other hand, habtU of flabby 
table construction will obstruct critical and careful planning. 

It is the qualitative varbbtc that is particularly susceptible to equivocal 
treatment and interpretation, since concepts us^ in the investigation of 
attitudes, motives, and preferences, for example, are not standardized 
and usually have their origins in common sense. Such is typically not the 
case with eoncepU covering quantitative phenomena tike income and age, 
which are automatically reducible to mutually exclusive variates. In an 
age distribution, it is impossible for a person to occupy two places at the 
same time in the array. 

As for the reader of the table, he loo should approach the table critically. 
A useful technique is to seek out the conspicuous item, study the relative 
magnitudes or frequencies, and analyte the elements In the title in relation 
to the content of the table. In this manner, the es-senliala of the table 
that are relevant to the purposes of the reader arc easily extracted. 

Questions and Problems 
1 Define the following concepts: 

Qualitative Grouping 
Principle of Inclusiveness 
Principle of Exclusivenfss 
Mixed Variables 
Overlapping Categories 
Multiple Entries 

2. (a) Why is it reasonable to list attributes according to magnitude of fre- 
quency? 

(h) Cite instances when this rule may not be observed. 

56 



5 3^ QCALITAnTE Vaetabixs 
3. Criticiic the foHoniiig mock table: 












Oioin^o or P*T* 


Mott ImjKtriant Rtaton for //ann? I/itl Chill, //mUn/fi 
ToWf 5.2.6 an<J U'w, /ii<fia«apo/«i, I9SO 


flcxviv ILiTtD Jto*T InroaTaWT 


DB9 

A strong liking for children 


■9 

A Ichcf that It li a religious duly to have a family 



The traditumsl Wwf that mirrW eouples might to l.jve 


. 1 

children 

• 123 

i: 

A ferling that It 11 ImpoftaJit to earryon the family narie 

8 


A di^ire to spc ahat o*n ehildrea wncM l< Mr 



A feelmg that children bring hu«lonil and oife clowr bv 



plher 

nr 


.Vot •loting an only tfci.’d 



Not to be left (hiidle«i In ca«r nf death of only ehi! J 

n 

5 

The desire of ehOdren for ranre brollter* an.l n*trta 

K 


WintinRapdlfoelj haAl«^y\,oe *U»y Uonly ha/ltltU 



faknown 



ToraUi 

B 

■EB. 


mum rrt«>1.w »nl r K. ISr'UWnl t 

^•rtJitr." (>«€»<«(« Jftir nu>.t ui>(ki<ri*(i 


ruch I'urpo*'^ bircnaf'*. or wu/riroKafr, tb.«.'>ir.rst»on« arr rK^yirrd 
m pLife of tin- fimpti’C uninjridl^ taMUJion. Cwr«-rLvwTration, «hlcJ» 
ii here brmctl(^i, iwfTrirftt»i all tUtistioal anaVtl*; hfnrf*, il will I** /rr^ 
qocnlly rncountemi In one (;yl«e or anothrr tkruuitticnit ihU trtl. 

Tbe dutnbution cl wicide rnt*-* accurdmi; lo navRnrtudc rupplir* an 
important Irmkdown ot th« data. Lot it »l«o r»i«^ new qurr-tiow »uch 
«hy arc the ritoi In rome citira kicher than In otlirni? why do thr rat« 
ma-*saround I2prr lOO.OOOand thinowt on theextrrmM? Thrw'tjuf^tiona 
at!Time the e«'Jcnc«* of fartorf that ilctrrmine lh<* duTfrrnlLal fn.asniiud<-s 
of the rate*, and ch-allcnRe the ftathtiol inmtiRator to unrover them. 
For purpows of unvcilinc ruch relation-'hipa, the alatwtjral tnethud ofTrra 
a wide varwty of t«hni<i\i« of vaiyiivi’ complexity, one of tl«* »implr*t of 
which w the (-ro'cf-taL'uLiCion. Iliwprotrtlurewmultanfon'fy KroujM a ret 
of items by two or more critcii* in«tead ^ only one. Tlie re«tiltinR fre- 
QUenciM — called ^(nnt/rrTuersetraheeau-W* they record the numlwr of joint 
occurrencM — pvc cIum to how the Toruhlrs nuiy be linlctl. and they 
may often be more effectively employed than more ad\-anccd, complex 
correlational technirjue*. 

Typei of Cro**-Cf<««/ea/i'on nrvf TMr Inttrprttalion. In Table 3, 3. la, the 
107 cities in Table 3.1 .la have now been erosft-tabuUted by rixe of suicide 
rate and g;eoBraphical repon on tho pbu.*ible auppoeition Uiat Trgiotial 
58 






§ 3^ Cross-Cl.issification 


differences may at least partially account for the variations observed in 
the univariate distribution. It is evident from the percentage distributions 
(Table 3.3.1b) that the cities of the Northeast, South, and North Central 
regions do not differ markedly from one another. However, the cities of 
the West exhibit a very deviant pattern: all fifteen of the western cities, 
with one exception (in the interval 12-14), are in the upper half of the 
distribution. They are conspicuously higher than the cities of the 
other regions. Apparently, there are social and demographic conditions 
in the western cities that increase the incidence of suicide. We would 
logically turn to the characteristics of these cities in the effort to account 
for the variation in rates. The cross-classification has served its preliminary 
purpose. 

The foregoing table was made up of one quantitative and one qualitative 
variable; but two qualitative variables may be cross-tabulated in the same 

fiuta'dc Rates by Magnitude and Region, 107 American 
Table 3.3.1a Cities 


Suicms 

Rate 

Region 


N. East 

Soulh 

N. Central 

West 

3- 5 

4 

2 

0 

0 


6- 8 

6 

9 

3 

0 


D-Il 

11 

9 

9 

0 


12-14 

8 

7 

8 

1 


15-17 

1 

3 

7 

2 


18-20 

2 

1 

0 

4 


21-23 

0 

0 

I 

3 


24-26 

0 

1 

0 

3 


27-29 



0 

2 


Total 

32 

32 

28 

15 



Souite; 13 S. Bureau of tha Ceosua, 11.5 Canau* a/ ib« Povutotion; ]850, Vol II, General 
Characterutica, Part I. U.8. Gonmmeot Pnotuic Office. Waafiinetoa, D C., 19S3. ' 


way. Thus, marriages may be cross-classified by religion of husband and 
of wife in order to ascertain the extent of marriages within the faith (Table 
3.3.2a). In the illustrated sample of 437 marriages, 393 are of a common 
faith. The relative degree of rehpous homogamy is better expressed in 
Table 3.3.2b, where cell frequencies have been converted to percentages 
of the grand total. Summing the diagonal percentages, left to right, we 
obtain 92 per cent — which neatly summarizes the extent of religious en- 
dogamy in this sample. 

But the apparent simplicity of a percentage statement should not be 
permitted to obscure its hazards. the base total (A) is not given, 

59 








Geouping op Data 


Most Important Reason for Having Last Child, Husbands 
TabJe 3.2.6 and TTiies, Indionapolts, 1950 


Reason IL^ted Most luPORTAiTT 

Wives 

Husbands- 

A strong liking for children 

667 

593 

A belief that it U a telieioua duty to have a family . . . 



The traditional belief that married couples ou^t to have 



children . ... 

■ 123 


A feeling that it is important to cany on the family name . . 

8 


A desire to see what own chJdren would be like 

6S 


A feeling that children bring husband and wife cl<»er to- 




147 

244 

Not wanting an only child .... 

167 

131 

Not to be left cKildlcsa in case of death of only child . . . 

14 


The desire of children for more brothers aod sbters 

33 

21 

Wanting a girl if only had boys, or a boy if only had girls. . 

71 

75 

Daknowa , . ... - 

27 


mmmmaaamm 




Eouree RonddFr««dnun«ndP K Whelplon, "SociUud Ptyebelepcal Ftetori AlfMtlng 
FertUity." UiZigiii QuaHfrtv, Jul7 I9S0. p. 430 (adtpUd). 


such purposes bivariate, or even rnuUipartote, ciasfificatiorkd are required 
m place of the simpler unu'amfd tabulation. CrossKiIassification, ^rhicb 
is here broached, permeates all slatbtical analysis; hence, it will be frO' 
quently encountered in one guise or another throughout this text. 

The distribution o! suicide rates according to magnitude supplies an 
important breakdown of the data, but it also raises new questions such as: 
why are the rates in some dries higher than in others? why do the rates 
mass around 12 per 100,000 and tbm out on the extremes? These questions 
assume the existence of factors that determine the differential magnitudes 
of the tales, and ehaiienge the statistical investigator to uncover them. 
For purposes of unveiling such relationships, the statistical method offers 
a wide variety of techniques of varying complexity, one of the simplest of 
which is the cross-tabulation. This procedure simultaneously groups a set 
of items by two or more criteria instead of only one. The resulting fre- 
quencies — called joint /rejuencifj because they record the number of joint 
occurrences — give clues to how the variables may be linked, and they 
may often be more effectively employed than more advanced, complex 
correlational techniques. 

Type* of Croti-Clo^sijiealion and Their Interpretation. In Table 3.3.1a, the 
107 cities in Table 3.1.1a have now been cross-tabulated by size of suicide 
rate an'd geographical region on the plausible supposition that regional 
58 


§ 3.3 CrosS'Classiucation 


we have no means of knowing whether the several percentages rest on 
25, 50, or 500 cases, and to that extent therefore cannbt judge their de- 
pendability. Percentages based on small absolute numbers are of doubtful 
reliability. For this reason it is good practice to include the total (N) 
with every percentage distribution, so that actual frequencies can be recon- 
structed. In addition, actual frequencies may be recombined, whereas 
percentages on different bases may not be so manipulated. 

A variant of the foregoing tabulation is one that expresses cell frequencies 
as percentages of either row or column totals, usually referred to as marginal 
totals. We sometimes select those marginals representing the “causal,” 
or independent, variable. For example, the avowed objective of Table 
3.3.3 was to discover whether the opinion of the interviewer exerts an 
influence on the answer he obtains. Since in this instance the hypothetical 
causal factor (interviewer opinion) lies in the rows rather than the col- 
umns, frequencies are expressed as percentages of the row totals. 

Respondent Attitude Toward Intervention, by 
Table 3,3.3 InUrvietcer Attitudes (Percentage Distribution) 


Intebviewtr 

Respondent ATrinmE 

Total 

Attitoce 

“Keep Out” 

“Help England” 

“Keep Out" 

“Help England” 

56 

44 

100% 

40 

60 

100% 


Source. Hsiiley Caatnl, Gavffint PuU>c <>p««6rn. Princeton Uairereitj' Press. 
Princeton, N.J.. 1944, p 107 (adapted). 


The data reveal that, to the “Keep Out” interviewers, a majority (66 
per cent) reply similarly with the “Keep Out” slogan, whereas the “Help 
England” interviewers elicit a majority (60 per cent) of "Help England" 
responses. Apparently, the opinion of the interviewer exerts some influ- 
ence on the response distribution, either because his opinion is unconsciously 
contagious, or because he unwittingly misclassifles the ill-defined and 
ambiguous responses in accordance with his own attitudes. 

An example of the cross-classification of two quantitative variables is 
presented in Table 3.3.4, in which 1,848 marriages are distributed by age 
of husband and by age of wife. In keeping with the purposes of cross- 
classification, we examine the joint frequencies in order to determine 
whether they form a discernible pattern. There is a fairly obvious trend: 
husbands tend to be slightly older than their wives. For example, of the 
719 husbands ages 20-24, 153 (21.3 per cent) are married to younger 
.women, 62 (8.6 percent) to older; of the 850 wives ages 20-24, only 10 
(1.2 per cent) are married to younger men but 336 (39.5 per cent) to older. 

6l 









GBOtTPiNO OT Data 

Suiade Rates hy JlfBynfli«fe and Region, Pereenlage 
Table 3.3.1b Distribvlion 


Suicide 

Regioh 

Total 

N. East 

5mi(A 

AT. Centroi 

Tre»{ 

3- 5 

12.5% 

6.3% 

ojo% 

0.0% 

5.6% 

6- 8 

18.8 

28.1 

10.7 

0.0 

16.8 

&-11 

34.4 

2S.I 

32.1 

0.0 

27.1 

12-14 

25 0 

21.9 

28.6 

6.7 

22.4 

15-17 

3.1 

9.4 

25.0 

13.3 

12.2 

lS-20 

62 

3.1 

00 

26.7 

6.6 

21-23 

00 

0.0 

3.6 

20.0 

3.7 

24-26 

00 

3.1 

0.0 

20.0 

3.7 

27-20 

0.0 

0.0 

00 

133 

1.9 




1000% 

100.0% 

100.0% 


Sourft S«e T*b]» 3S.U. 


ilarriagcs Religion of Husband and Wife, 4S7 Married 
3.3.2a Couples, Nero Haven, Conn. 


Husband 

Wipe 

Total 

Catholic 

ProtesUct 

Jennsh 

Catholic 

271 


0 


Protestant 

17 

61 

0 


Jewish 

1 

1 

66 


Total 

289 

82 

66 

■5B 


6ourc« A B. HoUinesbcad, ' Cultural FBClor9uitb«S«lectkoa o! American 

SonoZo^icot Anwu, XV, 1950, p. 623. 


Marriages by Religion of Husband and Wife {Percentage 
3.3.2b Dtsirilwtion) 


Bcbsand 

Wife 

Total 

Catholic 

Protestant 

Jewish 

Catholic 

62 



66 

Protestant 


IS 


19 

Jenish 

a 

0 

IS 

ts 

MLSU -CENTRAL LIBRARY 

19 

15 

100% 


60 


10I2I 








5 3 J CEOss-CwssmciTioN 


(b) Docs there appear to be a relatioiiship between sex and occupation? 
Dxplaln. 

Major Occupation Group of Employ fd Personshy Ser, UE., 
Table 3.3.5 July 7-lS, t9S7 


XfA/OH OCCUPATIOV 

Grocp 


Professional, technical, and 

Lindred workers 

Farmers and farm managers 
Managers, o^ciats, and pro- 
prietors, except farm . . 
Gerical and Lindred workers 
Salts worltwa 

Cnflsracn, foremen, and 
kindred workers 
Operatives and kindred 
workers 

Private hou*cho!d workers 
Service workers, except pn. 

ssite household 
Farm laliorers and foremen 
Laborers, except farm and 


* Triouianda cf prnnaj 14 rru-« of ac* ud ortr. 

Source- U.S. Bumg of tbe Conm*. CMrrmI P*p<ilaJion Rtparit. Serwo P-S?. S‘o. 181, 
CoTornment PnoliBg OScr, WMbtnftow. DC. Avgiut. )BS7 

Type of Famxly Organization hy Type of Economy, lOi 
Table ^^.6 Pnmifji-c 5oorfi« 




63 

























Gbodping of Data 

Other entries may be simihHy eramin ed for their possible sociologicnl 
meaning 

Table 3.3.4 Age ai Marriage, Hudiand and Wi/e, New Haven, Conn. 











m 












BRB 










WtU 






































iSsH 










OH 









H 

lEM 










IBI 


^1 









SdurM; A. B HotiiBsihtid, «p. «&. p. 622 


The construction of a joint frequency table, such as those illustrated 
above, incorporates no new principles; it is guided by the same considera- 
tions that govern the univariate frequency table. The number and the 
size of the class intervals should be fixed 'vith a view to attaining as much 
simplicity as possible, without at the same time distorting or blurring the 
relationship between the two variables. With qualitative data, the cate- 
gories should be exhaustive and mutually exclusive. If such simple prin- 
ciples are used as guides, the preparation of a cross-tabulatioti should 
present no difficulties, and the final result should be both competent and 
effective. 


Questions and Problems 

1 . Define the follow bg concepts: 

Cross-Classificatioa 
Cross-Tabulation 
Univariate Distribution 
Bivariate Dhtributbn 
Multivariate Distribution 
Joint Frequency 
Joint Frequency Distribution 
Independent Variable 
Dependent Variable 

2. Study Table 3.3.5. 

(a) List the important and conspicuous entries. 

62 





§ CROSS-CLASSmCATlON 


University Press, Princeton, 19i9, p- K9.) Arrange these results in a 
4X2 table and attach an appropriate title. 

7. Cross-tabulate the paired variates of TaUe 3.3.8, using class inten-als of $o.00 
and one-half (.5) school year, and lower rounded limits of SIS and 7.5 school 
years respectively. Each pair of values characterizes a given census tract. 


Median Years of School Completed hy Median Monthly 
Table 3.3.8 Henlals, 115 Census Tracts, Indianapolis, 1950 



8. Older people take more Interrst in polirical elections than j'ounger persons, as 
suggested by Table 3.3.9. The differrace is especially marked among those 

65 











Ghotjwno of Data 


3. (a) In Table 3.3.6, which variable — Camily organisation or basic economy — 

would 5 'ou consider independent^ 

{b) Express ceil frequencies as percentages of row totals. Does this enhance 
the effectiveness of the table? Explain. 

4. (a) In Table 3 3.7, which vamble would ywi eonsidw independent? 

(b) How would you explain the small number of “Unhappy'* ratings by both 
husbands and wives* 

(c) WTiat percentage of husbands and wives agree in their ratings? 

il/ari<aZ flappineta Ratings of Husband and Wife, 252 
Table 3-3.7 Married Coupks 


Wife’b 

IIaiino 

Hosband's Ratino 

Total 

Per 

Cent 

Very 

Unhappy 

Unhappy Average Happy 

Very 

Happy 

Very Happy 



3 

24 

112 

140 

65.6% 

Happy 



12 

3S 

12 

62 

24.6 

Average 


3 

14 

7 

e 

30 

11.9 

Unhappy 


11 

2 



14 

5.5 

Very Uiiiappy 


1 




6 

2.4 

Total 


15 

31 

69 

130 

252 


Per Cent 

2.8% 

59 

12.3 

27.4 

51.6 




Source Ernest W. Burgeu and Leonard 8. Cottrell. "The Fndietioa of Adjuitmeat in 
Marntge," Amerwtn Soculogtail Amrv. 1, 1936. p. 741. 


5. A club has 100 members. Among tiiem are 50 lawyers and 50 liars. The 
number of members who are neither lawyers nor liars is 20. Construct a 
2X2 table to show the number of bwyeis who are liars, the number of lawyers 
who are not liars, the number of liars who are not lawyers, and the number 
who are neither. 

6. Durmg World War II, white comn^ioned and non^mmissioned officers were 
asked to compare the effeettveness of colored troops with that of white troops. 
The classification of answers was as follows: 

(a) Not as good as white 
fb) Same as white 

(c) Better than wiute 

(d) No answer 

Of the commissioned officers (Y •= 60), 5 per cent gave the first reply listed, 
69 per cent the, second, 17 per cent the third, and 9 per cent the fourth. Of 
the noncommissioned officers CY = the distribution was 4 per cent, 83 per 
cent, 9 per cent, and 4 per cent. (Source: Samuel A. Stouffer el al., The Ameri- 
eon ScldUr, Vol. 1 of iSludve* b ^Social Psyekology in World War II, Princeton 


64 







$3.4 Tiue Sekies 


of social change, are as significant to the sociologist as is the description 
of its static aspects. Not only are we interested in the momentarj’ size of 
the American population, but also in its growth or decline. The social 
analjTt does not restrict his attention to the divorce rate in a given year, 
but necessarily also examines its fluctuations from year to year. Social 
trends are as revealing of the laws of social beharior as are the cross-sections 
of social life. 

The time senes (Table 3.4.1) is one of the simpler devices for the por- 
traj-al of cultural and social changes; it is a set of %-alues s\-stematically 
arranged in chronolc^cal order, and” is thereby distinguished from a mere 
chronicle of events. 2^ei-erthdess, the eonstructioa of a time series is 
governed by technical considerations analogous to those entering into the 
preparation of the frequencj' distribution: (1) the inter\-als should be of 
such length as to depict the essentbl trend or pattern \s-ithout needless 
detail: (2) they should be expressed in familiar units of time and their 
multiples, such as the month or j'ear; and (3) thej* should be uniform in 
size and, when possible and fitting, evenly spaced throughout the series. 
Furthermore, the variable should have a constant definition for the entire 
period covered by the series, and the boundaries of the geographic area in 
which the events occur should also remain Rxed throughout. These rules 
will be more difficult to enforce as the time series increases in duration, 
owing to the mallcibibty of social circumstance. 


Table 3.4.1 

A’umtfT of Dtivrecs per lOO Mar~ 
tiayts, U.S. Selected Tears, IS70- 
1955 



D.C. 


The Time Inleml. There is, of course, no standard time inten-al for all 
occa.'ion«; rather the intcn-al will wy according to the requirements of 
the iincstigation. Tlius, the purpose may be to determine the hourly 

67 




Groupina Of Data 


hsviag more eduoatioQ, but it is also piw«Jt among those with less education. 

(a) Condense Table 3.3.9 to show the percentage distribution of interest in 
elections by education, imapectWe M age. (Suggestion; convert given 
percentages uito absolute freq^iendea, recombine as necessary, and convert 
to percentages ) 

(b) What mformalion is "lost” in the condensation? 

Tohle 3.3.9 Interest t» Polilicol £Iecl»m*, by Edvcaiicm and Age 



EnucATiox 

DeGSEC of ISTTEREST 

KoHigh School 

Some High School or More 


Cfmfrr 4$ IV*. 

4S or Older 

Under 4S Yrt. 

iS or Older 

Great 

19% 


26% 


Little 

SI 


74 


Total 

100% 

100% 

100% 


HBBBi 

37$ 

$69 

1,174 

@9 


Roum P LtiariMd. B B«rel»n. and It. Gtudn. TA« CAo>e>, Columbia Unlvemly 

Pma, N«» York. IM, p 41 


9. la Table 3.3.3a, nhat per cent 

(а) ot Catholic husbands marry Catholic wives? 

(б) of Catholic wives marry Catholic husbands? 

(e) of Prateetont husbands marry Catholic wives? 

(d) of Catholic husbands inarry Protestant wis-w? 

ItTiat light do these figures throw on the teudeoey of men and women to cross 
the religious curtain? 

10. (al Of husbands 2»-39. calculate the percentage married to younger, to older, 

and to women of the same age (Table 3.3.4). ' 

(b) Of wives 25-29, calculate the percentage married to younger, to older, 
and to men of the same age. 

(c) Comment on social significance. 


SixrnoN- Four 
Time Series 

The A^ofure 0/ the Time 5eriV*. Up to this point in the discus.sion, we have 
considered the variable only at a ^ven instant and have ignored its pre- 
Burnable changes in magnitude in successive intervals of time. However, 
the dynamics of social orgaiuzaUoa, vrtdcb manifest themselves in patterns 
66 






5 3.4 Time Series 

but the changing content of the variable as well. For example, a shift to 
the legal definition of juvenile delinquency to include seventeen-year-old 
boj-s, who had previously been excluded, would produce an apparent in- 
crease in delinquents, and create a false bnpression of moral decay. Need- 
less to state, a careful worker would endeavor to adjust his data to com- 
pensate for such revisions in definition. 

Similarly, the effects of improvements in diagnostic procedures on sta- 
tistical fluctuations in disease rates may be confused with the effects of 
the increasing average age of the population, unless appropriate corrections 
are made. This is well illustrated id the field of cancer, where classificatory 
changes make it difficult to discern whether the trends in the cancer rate 
are real or apparent (Table 4.4.1). The prescription of a fixed v’ariable is 
especially difficult to follow in the social realm, since the definitions of 
many of our social variables are undergoing almost constant revision. The 
definition of delinquency is much less stable than that of a chemical. 

Comtancy of Arta. It is equally desirable that the time series be based on 
a relatively fixed geographical area; otherwise the sequence of measures 
may merely reflect alterations in area boundaries. Twentieth century 
demographers have actually encountered such difficulties in measuring 
the population growth of European nations because their boundaries have 
been so frequently and drastically redrawn. 

Similarly, between 1915 and 1933 in the United States, there was con- 
siderable unreliability in the calculated trend of the national birth rate 
because of the periodic addition of states to the birth registration area 
(Table 3.4,3). In 1915 the rate was based on the returns from only ten 
states; not until 1933 were all states included. Subsequently, national 
rates have been prepared, incorporating natality estimates for the non- 
registration states, an adjustment made necessary by the lack of uni- 
formity in the registration area. All yearly rates are normed to the 
same base area; hence, we call this a norming operation. 

Measures of Change: Relative and Absolute. A time series is a succession 
of chronologically spaced observations, and is designed to depict p’owth, 
decline, or simply variations in the incidence of the things observed. These 
quantitative obsers'ations may be in the form of absolute values or relative 
values. For example, the rise in divorce may be measured as absolute 
increase in the number of divorces, or as a nsc in the rate of divorces per 
100 marriages. These changes are not necessarily in the same direction; 
nor do they even supply identically useful information on questions which 
the sociologist may raise. If this fact is not appreciated, the worker will 
often come up with inappropriate answers. 

Table 3.4.4 displaj's the baste observations on which we may compute 
the growth rate in the population 65 years of age and older. There are 

69 



Geottpinq op Data 


variation in telephone calls or automol^e trafGe, or the daily variation in 
Bchool attendance, or in industrial absenteeism. In studies of tea»mal 
vanaitem (Table 3.4 2), the month is a convenient unit, wherc.^s the year 
or decade may be an appropriate interval to depict a long-term trend, such 
as the growth of the nation’s population. Very long term trends in which 
fluctuations are smoothed out are sometimes known as tecular trends. 


Table 3.4.2 

Number oj Marriages and 
Marriage Rates, by 
Month, U.S., 1056 


Moktu 

NCUSER 

IUtt • 

January 

101.000 

7.1 

February 

00,000 

7.5 

March 

102,000 

7.2 

April 

117,000 

8.5 

May 

12S.OOO 

9.0 

June 

201,000 

14.7 

July 

128,000 

0.0 

August 

150,000 


September. . . . : . 

145,000 


October 

130,000 


November 

120,000 


Deeember 

133,000 

0.4 

Total, 

1,560.000 

0.4 


.-.— VI cut* jruaytif pepuiAtisa, 

•djiuifd (« knflh «( mwilh. 

OfTie* or VuAl SW„ue*. Vial 
5|«<w K<p»ffA Marriiffn and Dxmnn. J958, 
*• W-S- Oov.nimMi PrioUBi 

Offic*. Wubiofton. D.C.. IMS. 


comp«mble ,nd intcllisible. 
LXi ! 'HT' : P-Ponderan™ of June 

thulrr , fipcluatiop,. I, i, obvioui 



§ 3.4 Time Series 


persons 65 Years and Over, U.S. Populalion, 
Table 3 A A Selecied Years, t900-t9!,Q 


Year 

Total 
POPCUSTION 
(ra TH0C&.tNDS) 

pEESOJfS 65-f- 
(ts Toocissos) 

Percentage 
or Total 
POFU l-VTION 

1900 

76,094 

3,100 

4.1 

1905 

S3.S20 

3,504 


1910 

92.407 

3.985 

4A 

1915 

100,519 

4,501 


1920 

106,466 

4.029 


1925 

115,832 

5,7SS 

5.0 

1930 

123,077 

6,706 


1935 

127,250 

7,803 

6.1 

1940 

131,951 

9,03! 

6.8 


SouK«: Bur«ai| of (b« Coanu, Currml Peputalton Revortt. Papulation Ziti- 

motet, S*riM P-33. No. 114. VS. CoverasMat l^tiag Offin. WubinctOD. D.C.. 
Apnl 37. I93S, 


of old people in 1000 (3.000,000), and arrive at an increase of 200 per cent. 
Both of these measures are relevant when our interest is restricted to the 
aged population. For example, the manufacturer of hearing aids will be 
interest^ in the continued absolute growth of the older population, which 
constitutes for him an expanded market; he sells hearing aids to people, 
not to proportions. On the other hand, our interest may lie in the changing 
proportion of older people in the total population. National welfare 
administrators, for example, «ill want to know something about the 
changing ratio of dependent to productive persons. An insurance actu- 
aiy- will be concerned ""ith the increasing proportion of the aged, which 
is indicative of a longer average length of life, for it is on this average 
that premiums are computed; he is more concerned with probabillUes 
th.an with raw numbers. In such cases our interest shifts to Measures 
3 and 4. (3) Pereeniage point incrcosc. We sec that the aged consti- 
tuted 4.1 per cent of the total United States population in 1900, and 
G.S per cent of the population in 1940 — an increase of 2.7 percentage 
points. (4) Per cent increase in the proportion. To find this figure, we divide 
(he percentage-point increase (2.7) by (he percentage of old people in 
lOOO (4 1), and find tlial there has been a 06 per cent increase in the pro- 
portion of the total popubtion 65 or over. 

In certain instances, the ab<<Jutc and rebtive measures may show oppo- 
site trend'. Thus, between 1790 ami 1930, the Negro in .-kmerica increased 
from 700,000 to 15,000,000; but relative to the United Smtes popula- 

tion, he declined from 19.3 to 10 per cent. Tlie choice of measure, thens 
forr, must alwas's be adapted to the substantive issue. 

7t 











TatU 3.4.3 Cnidi Birih Koto, Am md V.S., ISI5-I935 

I I NtWBER OF States w Birth Bate itf U5. Birth Bate* 

Year « Registration Area lUoisnunoN Astx 



available to us at least four difTerent measurem«its, each of which is quite 
legitimate lor some purposes, but no single one of vhich can supply us with 
the sociologically relevant information in all situations. We may measure: 
(1) the absolute increase of persons 65 and over; (2) the percentage Increase 
in the absolute growth of the population of persons 65 or over; (3) the 
absolute percentage point increase in the proportion of old people in the 
total population; and (4) the percentage increase in the proportion of old 
people in the total population. 

Let us compare the years 1900 and l9l0uiTable3.4.-t to see what these 
four different measures mean. (1) The ahaUute increase. In 1900, there 
were about 3.000,000 persons 65 or older in the United States, and in 1940, 
there were about 9,000,000 — an absolute increase of 6,000,000. (2) The 
percentage vnereose in the ohsolufe prendh. To find the percentage increase, 
we divide the absolute increase of old people (6,000,000) by the number 






S 3.4 Time Seoxes 


Table 3.4.6 


Divorce Rate and Percent- 
age of Married Women 
Gainfully Employed, Se- 
lected Years, U.S., IS90- 
1950 


* Compiled from buUetinj of the Women's Bureeu. U.S. 
Oepertmeot of Ijibor, U.S. CoTernment Pnntiog Office. 
Weehmclon. D.C. 


DnroRCE 

R.tTE 


Perce-vtace or 
Women Emploted • 


1S90 

1000 

1910 

1920 

1930 

mo 

ms 

1950 


13.4 

17.4 

16.5 


25.6 

24.8 


ever, that would be a premature gcneraliiation, since it would be possible 
that the two trends are actually a common consequence of one or more 
hidden factors — or it may be sheer coincidence. 

In any time scries of significant duration, the observed trends are alwaj’s 
affected by innumerable, uncontrolled, and unidentified factors which 
make simple interpretations a precarious venture. Nevertheless, time 
series, when carefully constructed by keeping potentially disturbing factors 
constant, supply objective comprehensible measured of social change Viith* 
out which certain types of social analysis would be impossible. 

Question’s and Problems 

1. Define the following concepts 

Time Senes 
Feawnsl Van.'itiao 
FecuUr Trend 
Abjolule Change 
Relative Change 
Cumulated Time Scries 

2. Distinguish between a time series and historical chroaologj*. 

3. Cite three social variables (not given in tert) subject to change of definition so 
that It would be hazardous to espress their variation in a time series of significant 
duration. 

4. To what extent does the correspondence between the changes in the di\xprce 
ami rmploj-inent rates establish a eaasal relation (Table 3.4.C)7 

5. (a) Suggest a soeiologieal interpretation of the seasonal «riatjon in marriages 

(Table 3.4.2). 

(b) Account for the peak rate in June, the trough rale in January. 

C. (a) February has fewer marriages than March but shows a higher rate (Table 
3.4.2). Eaplain this seeming contradictioo. 


73 



Groupiko or Data 


CumuJaJed Time Senes. Curnuiaticm isnothing more or less than the act 
of joining contiguous intervals and adding thdr frequencies. Under certain 
circumstances, the time series is subject to cumulation like a simple fre- 
quency distribution Such is the case when the separate items constitute 
a meaningful total frequency, distributed throughout the period, and 
which are cumulatively accounted for at the close of that period. An 
example is prorided by the rate of diffusion o( the postage stamp among 
the countries of Europe and North and South America (Table 3.4.5). 
Prior to 1840, only one country had adopted this device; by mid-century, 
nine countries were using tins method; however, not until another thirty 
years had elapsed had all of the 37 iodependent countries introduced the 
postage stamps. Most countries waited an average length of time to 
adopt the postage stamp. Such a simple technique of tabulation may 
often be effectively utilized in a preliminary manner to reveal the course 
and tempo of the diffusion of cultural traits or the flow of information. 
In fact, such tabulations have been the occasion for the formulation of 
“universal” laws of growth and diffusion. 


Table 3A.5 

Dale ofFiret Postage Stamp Issue, 
Selected Ccunines, ISSS-/SSO 


Date 


Noubeb 

f 


1836-40 

1841-45 

1846-50 

1851-55 

1866-60 

1861-65 

1566-70 

1871-75 

1876-50 


Eourc« II. Etrl Pemb»rlon. "Tbe Curve c 
CuUure DtffustoQ R»l*,** Ammasn Sonc 
iot\aal Renew, 1, IBSS. p. 552. 


ConcwTcnt Trends Two or more rime series may be juxtaposed for the 
purpose of exhibiting the relationship between them. This is an ele- 
mentary but useful procedure which may direct preliminary attention to 
a possible statistical association, which may then be measured by more 
advanced methods. 

We illustrate such a comparison by retting up the employment rate 
of mamed women alongside the divorce rate in parallel tabulation. There 
« a startling similarity in the rates of growth of the two series. One might 
Iherefore l« tempted to conclude that ttie increasing divorce rate is func- 
tionally related to the increased employment of married women How- 
72 



Graphic Presentation 


Section One 

The Histogram and the Frequency Polygon 


fundion 0 / rte Graph. All tabubr material can be converted into graphic 
forms, which are usually more effective in coove>ing an impression of the 
pattern of the total distribution than is the more intricate frequency table. 
Being the areal equivalent of frequency, the graph gives visibility to the 
di'tribution, and consequently is more readily suggestive of its meaning 
and interpretation. The principal function of the graph is to convey an 
accurate conception of the shape of the frequency distribution — a con- 
ception which not even the skiUed expert, much less the lajunan, could 
construct mentally from the raw tabular data. We cannot expect the 
table to do what the graph can do better. 

There are many types of graphic representation, each useful in its own 
way when properly applied. Some of them are verj; ingenious and elabo- 
rate, but only the basic and simplest types will be here discussed: (1) the 
histogram, (2) the frequency polygon, (3) the cumulative frequency poly- 
gon, (4) the bar chart, (5) the statistical map, and (6) the lime chart. Of 
these six types, the first three are applicable only to quantitative data. 
The bar chart is designed primarily for qualitative data; the statbtical 
map displaj's items classified by geographical area; and the time chart is 
the graphic version of the time series. 

Since graphs are the equivalents of tables, they should cany analogous 
identifj'ing titles and markers. The}’ are subject to the same criteria of 
intelligibility, simplicit}’, and clarity. In fact, a graph cannot be planned 
or constructed untD the corresponding table has been prepared. 

Construction of the Ilislogram. The faistogiatn consists of a set of contiguous 
columns whose heights are proportional to class frequencies, and whose 

75 



Groutixo or Data 

(b) October and November have identiea! rates but differ in the numWr of 
mamages. Explain. 

7. Is the ratio of divorces to marriages a reliable measure of incidence of marriage 
instability m such elates as Nevada? 

8. In how many ways is it possible to measure changes in volume of population 
under 21 years of agcT Measure the«e changes in the U.S. population between 
1900 and 1900. 

9. Account for the pattern of diffusion of the postage stamp in the period 1840- 
1870 (Table 3.4.5}. 


Selected Referekces 


JenkinaOD, Bruce L., ffumu Cmn$ Sfanuai ej Tohular Ptnenlalion, 
U.S. Goveroment rrintuig Office, Wathiogton, D C., 1919 

Kemay, lobn G., J. Laurit SmU. and Gerald L. Thompson, fntredwctien 
to fi'nit* .UoiiAemdiiet. PreBtke>lial^ loe,, Engteirepd Ctiffi, KJ., 
1M7. Chapters. 

Kendall, Maurice 0., TAe AAoared Tieery »f Stotiatira, Volume 1, fifth 
edition. Hafner Puhliahinc Co., New York, 1952. Chapter 1. 

Walker, Helen M., and Walter K. Duioet, SlainUnl TMtt. Bureau of 
Publiealion*, Teachers College, Columbia UnlTenily, New York, 


Zeiacl, Haaa, Say It wi'tA fiyurrs llaiper & Brothers, New York, 1937. “ 
Chapters 8 and 0. 


74 



§ 4.1 HiSTOGRJUI and FrEQUENCT PoLTaON 

by the number of class intcr\*als to be plotted. The result Trill be the 
number of linear units allotted to each interval. Beginning at the inter- 
section of the two axes, we can now lay down the class intenr-als and mark 
their boundaries with tiny upright lines, or ticks, appropriately placed 
inside the axes, since they may be construed as vestigial stubs of the 
original grid lines. Because these ticks represent the contact points of 
the contiguous intcrr-als, they are designated by markers carrying the true 
T'alues of the class limits. Such true lunits will necessarily be in fractional 
form when the values have been rounded to the nearest whole. Thus, in 
Figure 4.1.1, class limits are designated ns 2,5, 5.5, and so forth. When, 
however, values have been rounded to the last whole, as in the case of 
United States Census age data, the true boundaries may be simply written 
as integers: 0, 5, 10. In the interest of clarity, markers maj* be uniformly 
spaced at selected class boundaries or placed at selected midpoints. By 
these discretionary procedures, the cluttered effect of crowded markers 
may be avoided. 

The frequency scale is established on the vertical axis in an analogous 
manner. We take the largest class frequency to be accomodated in the 
graph, and divide it by the number of linear units available on the previ- 
ously dranm a-ris. We thereby determine the number of items to be as- 
signed to each unit on the upright axis. Thus, in Figure 4.1.1, two items 
correspond to one unit. Beginning with the zero origin at the intersection, 
markers are now spaced at equal intetv’als, and given in such multiples 
(5, 10, and so on) as may be clear and btelligible. 

HaTiog marked off the two scales, we are now ready to draw the columns 
of the histogram within the frame of the axes. The vertical outlines of 
the columns are erected on the points of the true boundariesj and their 
heights are determined by the frequencies of the respective class intervals. 

Of the aforementioned directives, the rule that the frequency scale origi- 
nate with zero is infie-xible; otherwise the required proportionality among 
column areas could not be maintained. A violation of this rule is illustrated 
in Figure 4.1.23, where the bottom half of the histogram has been ampu- 
tated. The ratio of the first two class frequencies is 20 to 25, or 4 to 5. 
However, in Figure 4.1.2a the ratio between the areas of the first two col- 


Figtoe 4.1.2a Histogram 
{IncoTred) Wiihoul Zero 
Origin 



77 




Graphic Presentatiom 


widths are proportional to the are of the class inter\-als of the triable. 
A histogram based on the frequency distribution of 107 suicide rates 
presented in the last chapter (Table 3.1.1) us shown in Figure 4.1.1. It is 



FionRE 4.1.1 Ilitlogram of SuicidU Rotes, 107 Large tf-S. Csfvr*, 1950 


not only ft graphic record of the absolute cbss frequencies; it also mirrors 
the size of each frequency relative to all others. 

The histogram is constructed on simple arithmetic graph paper, which 
13 ruled with equally spaced horizontal and vertical guide lines. These 
intersecting guide lines are thereby divided into equal linear units which 
are a necessary base for systemaUe plotting. The first step in the pro- 
cedure IS to draw at right angles on selected grid lines two ases of approxi- 
mately equal length, with their intersection near the lower left-hand 
comer of the page Class boundaries are plotted on the horizontal axis, 
and class frequencies are plotted on the Krtwal cuts. 

However, before laying off the class intervals on the base line, we must 
fix the appropriate number of linear units to be assigned to each class 
interval of the table. It is often good practice to allow for a vacant inter- 
val at either end of the horizontal scale, which improves the aesthetic 
appearance of the graph and promotes readability. We accordingly count 
the total number of units along the lei^th of the axis, and divide that total 
76 





5 4.1 HisTocRAi! axd FExarascT Poi.too:t 



Figvre 4.1.33 HUtoffram (lnet>rrett), ExUndtd HontorUal Axis 


Fiqcre 4.1.3b Histogram (/n- 
C(3rTcct), Extended Vcrtual Axis 



79 




Graphic Pbesestation 


umns is 1 to 2, instead of 4 to 5, so that the areal ratio fails to correspond 
to the frequency ratio. The areas of the other columns in 4.1.2a are equally 
distorted. The correct ratios are displayed in Figure 4.1.2b. 



FioCTiE 4.1.2b HUtogram 
(Cerrtci) JTiti Zero Origin 


However the rule that the two axes be of approximately the same length 
is not inflexible; it is simply considered good practice. Unless there is 
an obviously good reason for resorting to unequal axes, the resulting gmpb 
is likely to produce a distorted impres^on. Dy extending the base line 
and shortening the vertical axb, the histogram may be made to appear 
long and flat, thereby conveying an impression of great variation. On the 
other band, by lengthening the vertical axis and contracting the base line, 
we produce a tall narrow figure, which leax'es the impression of only slight 
variation. These effects are illustrated in Figures 4.1.3a and 4.1.3b, which 
portray the same table. 

VnequalClois iTUervals. In Table 4.1.1, the last interval is twice the width 
of the other intervals. Its frequency of four is therefore not comparable 


Table 4A. I 


Frequeney Dislribvtion, Unequal 
Class Iniervals 


0- 4 
5- 9 
10-14 
16-19 
20-29 


Total 


78 


Sourc«; Hypothetical 



§4.1 HtSTOGR-lM AKD FEEQiniNCI POLTOOX 



Ficrsc 4.1.4b Hi^ofram (/ncorred), Clas* IntmaU 


We can now appreciate the practicability of the rule set forth in Chap- 
ter 3 that class intervals be of equal width; or, failing this, that unequal 
class intervals be convenient multiples of smaller ones. Similarly, the 
caution against open-ended tables has now acquired new practical meaning. 
There is no way to adjust the frequency of a class interval of unknown size, 
since an indefinite interval cannot be e-xpressed as a multiple of a dosed 
inteival. It is therefore impossible to represent an op)en-ended table by 
the histogram, unless one arbitrarily doses the interv'al — a measure to 
which we sometimes may resort. 

The distinguishing feature of the histogram is its schematic simplicity. 
Columns are more easily apprehended than numbers. It clearly reveals 
the relative concentration of items in each interv'al, and shows the contour 
of the distribution. 


81 



GuYHIC PRtSINTATlOX 


to the frequencies of the smaBer intervals. To establish comparability, 
and preserve spatial ratios, it is therefore necessary to break up the last 
interval into two intervals of equal width; and we must break up its 
frequency correspondingly into two equal frequencies of two each. These 
adjusted sub-frequencies are then plotted (Figure 4.1.4a). Had the un- 
adjusted frequency of four been plotted (Figure 4.1.4b), the frequency 
column on the estreme right would have enclosed twice as large an area 
as was its due, and thereby created an impresaon contrary to fact. 

In general, before graphing a table in which class intcrvab are unequal, 
all larger intervals must be expressed as multiples of the smallest interval; 
and these obtained multiples divided into the coTrespondlng frequencies. 
This latter step will yield the adjusted frequencies, which are then plotted. 
This procedure is in accord with the principle that each item in the fre- 
quency distribution be represented by an equal area on the graph, so that 
relative frequencies be proportionately presented in area form. 




5 4.1 Hisroaiuu and Freqxtenct Poltcox 

off on the base line, and the frequency scale along the vertical axis. At 
this stage, however, the procedure diverge, in that points and connecting 
lines are plotted instead of columns. The points are plotted over class 
midpoints at heights called for by the respective class frequencies, and then 
joined by straight lines to form the polygon. 

The frequency polygon need not be extended to the base line beyond the 
range of actual obsen’ation. In fact, in a U*shaped distribution, in which 
the concentration of cases increases at either end, it would even be illogical 
to do so. It would appear, therefore, sound policy to leave the polygon 
open at either end, and thereby avoid the hazard of misrepresenting the 
frequencies where none were obserx'ed. In anj' event, the general contour 
of the bulk of cases is in no way affected by open ends, and it is this shape, 
after all, which the polygon seeks to portray. 

When a closed figure is contemplated, the usual procedure is to extend 
the graph to the base line at the midpoints of the vacant intervals on either 
end (Figure 4.1.6b). The rationale of this practice is to preserve the area 



enclosed by the histogram — which ^^bolizes the total frequency — 
by setting up pairs of congruent and compensating triangles. While 
this is reasonable enough, it should not be overlooked that the polygon 

83 



GbaPHIC pEESENTATlON 


The Nature and Comtruction of the Frequency Polygon. When items are 
compressed into a relatively sraall number of broa^ intervals, the resulting 
class frequencies tend to jump in an abrupt manner. It is reasonable to 
suppose, however, that the progression rf^dass frequencies would be con- 
siderably smoother if we employed many relatively small class intervals. 
It is the function of the /re?u«««y polygon to provide an approximation of 
the smooth curve that would presumably emerge if class intervals were 
made as small as possible ahd the number of observations were unlimited. 

The frequency polygon may be derived from the histogram by simply 
connectmg with straight lines the midpoints of adjacent column tops. 
Such a conversion is illustrated in Figure 4.1 5a, where a frequency polygon 



FiamtE 4.1.5a Hutogfam and Frequency Polygon, iSuiCMfe Raiei, 
Large V.S. Cttiee, 1950 


is tu’perimpoHd on the histogram of suidde rates. In practice, only one 
paph or the other would be presented, depending on whether the emphasis 
IS to be placed on the tabulated class frequencies or on the hypothetical 
point frequencies, which the frequent polygon provides. 

U should be evident th^t the procedure in constructing the frequency 
polygon wdl parallel that of the histogram in all but the final stages. First, 
two axes are drawn on the graph paper. Class intervab are then marked 
82 


Cumulative Fkequenct Poltoom 


>racticdl purposes only one would ever be constructed. In constructing 
ess than” CFP (Figure 4.2.1a'), tbe cumulated frequencies are plotted 



Fiouez 4.2.1a "Lew (Aan” Cwnitlalite Polygon, Age Distribution, 

UJS., 1950 


over true upper boundaries of class intervab; whereas in constructing an 
“or more” graph, the frequencies are plotted over the true lower bound- 
aries (Figure 4.2.1b). This procedure differs from that of the simple 

fAioilies are less than aize 3, and 45 per rent are more than 3, then families of size 3 
remain unaccounted for. 

Vrhen the data are continuous, hcrwever, Uiia difficalty does not arise, since the cutting 
point does not occupy any part of the eoDrinuum. Thus, the statement that in 1950, 
36 per cent of the United States population «as less than 21 years old, and 64 per cent 
were more than 21, would be precise only if 21.0 is conceive of as the contact point 
— a necessary statistical coiiTeation —between the intervals 20 and 21, which are 
each one year in width. Tbe problem arises when 21 b treated not as a point, but as 
an interval of one year, wMeh would be the epontaneoua lay interpretation. 

The derignation employed in this text u saffietenUy versatile to satufy the require- 
ments of the discrete data, and do no violence to the continuous data. 


87 



Graphic PMSENtATiON 


The Nature arid Construehm of the Frequency Polygon. When items are 
compressed into a relatively small number of broad intervals, the resulting 
class frequencies tend to jump in an abrupt manner. It is reasonable to 
suppose, however, that the progression of class frequencies would be con- 
siderably smoother if we employed many relatively small class intervals. 
It is the function of the /rcijueney ■pdyqm to provide an approximation of 
the smooth curve that would presumably emerge if class intervals were 
made as small as possible and the number of observations were unlimited. 

The frequency polygon may be derived from the histogram by simply 
connectmg with straight lines the midpoints of adjacent column tops. 
Such a conversion is illustrated in Figure 4.1.5a, where a frequency polygon 



Fiqure 4,1 5a i/vilojrom and Frtqueney Ptdygm, Suidde Rales, 
Large V£. C\lits, 1950 


IS superimposed on the histogram of suicide rates. In practice, only one 
pap or the other would be presented, depending on whether the emphasis 
IS 0 be placed on the tabulated class frequencies or on the hypothetical 
pom frequencies, which the frequency polygon provides 
It should be evident that the procedum in constructing the frequency 
polygon will parallel that of the histogram in all but the final stages. First, 
two axes are drawn on the graph paper. Class intervals are then marked 
82 


§4.2 CuMCiATmi Feiquenct Poltoos 

(b) Combine the last four intervals into two intervals of equal width and show 
by a heavy line how the appearance of the histogram would be altered. 

11. (a) Plot the frequency tables of American and National League 1957 batting 

averages (Problem 3.1.16) as histograms. 

(b) For purposes of comparison, should the histograms be superimposed or 
plotted separately? 

12. (a) Prepare a histogram for the distribution of U.S. families according to size, 

placing markers at midpoints instead of true class boundaries. 

(b) ^Vhat are the true class boundaries of the intervals? 

(c) Justify your choice of an upper boundary to close the graph. 

Siu of Famil}/ Per Cent 

2 32.5% 

3 22.8 

4 20.8 

5 12.3 

6 5.9 

7 or more 5.7 

100 . 0 % 

13. (a) Draw histogr&ms of the frequency distributions of Indianapolis census 

tract rentals and educational levels (Problem 3.3.7). 

(b) Compare the shape of the two diatributions and give a possible interpre- 
tation. 

14. Construct two frequency distributions: suicide rstes (s) east of the Mississippi, 
and (b) west of the Mississippi (Table 3.1.1a). Convert frequencies to per- 
centages. Plot percentage distributions as frequency polygons on one set of 
axes. Compare and interpret. 


Section Two 

The Cumulative Frequency Polygon or Ogtve 

The cumulative frequency table (Table 4.2.1) gives the percentage of 
cases below (or above) each given class boundary, but not the cumulated 
frequencies around the innumerable intermediate values within the inter- 
val. Not caa IbereSoie readily provide iViicee val-aea Vbat correspond to 
all possible cumulated percentages. Yet information on such intermediate 
values is frequently required. We may require, for example, the percentage 
of the population under 21 years of age; or the age such that 50 per cent 
of the population is younger and older. Although this sort of information 
could be obtained from the cumulative table by arithmetic interpolation 
between the class limits, it can be obt^ed with less effort and sufficient 


85 



Graphic Pbesentation 


inevitably disturbs the proportionility between column areas and class 
frequencies. It will always underrepresent the proportion of items m 
the modal interval. • • i j • 

We should therefore remind ourselves again that no statistical device 
ever perfectly represents the date on which it plays. The frequency poly- 
gon pretends to do no more than provide quickly and economically a rough 
first approximation of the theoretical frequency curve of values grouped 
into the smaUest possible intervals. For the purpose of comparison, fre- 
quency polygons representing diflerent distributions of the same variable 
may be mutually superimposed without blurring their respective outlinea, 
as would be true of histograms. 

Questioms and Problems 

1. Define the following eoneepta; 

Histogram 
Frequency Polygon 
Horitontal Axis 
Vertical Axis 
Frequency Seale 
Variable Scale 
Zero Origin 
Arithmetic Grid Papei 

2. Bow Is the appearance of the btstagiam changed when the horizontal scale is 
enlarged in relation to the verUcalt The vertical in relation to the horizontal? 

3. tVhat is the advantage of ptotting either relative frequencies or absolute 
frequencies? Is the appearance of the histogram altered when plotting relative 
frequencies instead of absolute? 

4. When is the frequency polygon appropriate for discrete data? Consider 
its appropriateneas for the amailest posrible interval of one and for intervals 
of increasing width.) 

5. State the principles governing the dosing of the polygon. 

6. If you wished to compare two diatributeons by superimporing the graph of 
one on the other, which type would you eelect : histogram or frequency polygon? 
Explain. 

7. Is the rule that the frequency scale originate with zero as essential in the 
construction of the polygon as H is fw the histogram? Explain. 

8. Which figire (histogram or potygon) adhms mm closely to the frequency 
table? 

9. Can the polygon be converted as readily into the hiatogram as the reverse? 

10. (a) Represent the frequency iRstiibataoii of delinquency rates (Table 3.1.7) 

^ by a histogram. 


84 



5 4^ CuiTDlATTVE FrEQUENCT PoLTOON 

all practicil purposes only one would ever be constructed. In constructing 
a "less than” CFP (Figure 4.2.1a), the cumulated frequencies are plotted 



Fiqubx 4.2.1a “Lett Cwmulalire Foiyt;<m, Age DUlrilnjtim, 

U^., 1830 

over true upper boundaries of class intervals; whereas in constructing an 
"or more” graph, the frequencies are plotted over the true lower bound- 
aries {Figure 4.2.1b). This procedure differs from that of the simple 

families are less than size 3, and 4S per cent are more than 3, then families of eize 3 
remain unaccounted for. 

Wtva Ibe d&ta it«Ah(wc(s, h tmr m , tins dd&ro&V; dws wA tnst, Race ttAVtn j 
point does not occupy any part of the contimium. Thus, the statement that in 1930, 
36 per cent of the United States population was less than 21 years old, and 64 per cent 
were more than 21, would be precise only if 21.0 is conceived of as the contact point 
— a necessary statistical convention — between the intervals 20 and 21, which are 
each one year in width. The problem arises when 21 is treated not as a point, but as 
an interval of one year, which would be the spontaneous lay interpretation. 

The designation employed in this text is siifGdently versatile to satisfy the require- 
ments of the discrete data, and do no violence to the continuous data. 


87 



Graphic Phesentatioh 


TaUe 4J2.1 Cumulative Age DiitrSndton, U£., 1950 


Age* 

“LissThak” 
Cumulative Per Cent 

“Ob More” 
Cumulative Peb Cekt 


10 7 (less than 5) 

100.0 (0 or more) 


19 5 Oess iO) 

89.3 (5 ot more) 

lO-U 

26.9 ... . 

80.5 

15-19 

33.9 


20-24 

41.5 


25-29 

49 6 .... 


30-34 

57.2 

60.4 

35-39 

64.7 .... 

428 




45-49 

77.5 . ... 

28.5 

50-54 

83 0 ... 

22.5 

55-59 

87.8 . . . . 

17.0 

60-64 

91,8 . . 

12.2 

65-69 

95.1 

8.2 

70-74 

97.4 

4.9 

76 sad over 

100.0 (less than 100) 

2.6 (76 or more) 


■ Ae« tounded U th* )ut whole ye v. 

Source 0 S Bureau of the Cencue, V.S. Ceoeue «/ (he Fopuloliefi! IfSO, Vel. II. CAoro»- 
leruKci ef (he Pcmltlion, Part I. United Sitlei Summary, 0 S. CoverDineBt PnDti&l Ofice. 
Wiahiniion. D C.. 19S3 


accuracy from the eumulativt /reguency polygon {CFP), or ogive, as it is 
eometimea called. “Ogive” is an arcbiteclura! term for the diagonally 
curved rib of the Gothic vault, which the cumulative frequency curve often 
resembles; hence, it was picturesquely so named in 1875 by the English 
statistician, Francis Galton. 

Construction of the CFP The construction begins with plotting axes as for 
the simple frequency polygon, class intervals are laid off on the base line, 
and frequencies along the vertical axis. Since frequencies are now cumula- 
tive rather than simple, the range of the vertical scale will be equal to 
the total frequency. For quicker comprehension and ready comparability, 
the cumulative frequencies are usually expressed as percentages, so that the 
frequency scale extends from 0 to 100. 

Corresponding to the two types of comulative frequency tabulations — 
the “less than” and the "or more” • — there are two types of cumulative 
polygons. But the two graphs provide identical information, so that for 

• Some writers employ the terminoioQr “iBg* thu’' and “more thin” — for example, 
less than 3 ' and “more than 3.” However, these terms create difficulties when ap- 
plied to discrete data unless they are treated as continuous. If 33 per cent of all 
86 




54^ Cc3jrwTiTx FREorrycT Poltccot 

order to close the tabulation of family incomes (see Table 4.2.2), T\hich 
reads “ 15,000 and over,” we would have to add literally hundreds of class 
inten'als since the maximum income is in excess of a million dollars. The 
corresponding CFP would be a useless monstrosity. 

Appfytnjr CFP. The CFP permits the frequency distribution to be par- 
titioned at any point whatsoe^’er, according to one’s interests and needs. 
For example, the point that separates the lower 25 per cent and the upper 
75 per cent of the items, the so^Ued first quartile (Oi), may be readily 
obtained in the following manner (Figure 4.2.1c): from the 25 per cent 
marker on the frequency axis, run a line parallel to the base line until it 
intersects the “less than” ogive; from this intersection drop a perpendicu- 



Fiockx 4^1e "Ltst Than'' CumuZaftM Poly^cn, Ag* DUtrQyjliem, 
Stlulfd Cftiila vtd QioriiUt 


89 



Graphic pHibESTATioN 



frequency polygon where class midpoints are plotted. Owing to the 
manner in which scales are off, a "less than” CFP wUl begin in the 
ower left-hand corner of the graph aod move diagonally across to the upper 
ngh^hand corner; while the "or more" CFP wiU begin in the upper left- 
ha^ corner and move diagonally across to the lower right-hand comer. 

When one or both ends of the frequency table are open, the CFP will 
no extend over the entire range rf the frequency scale and thus will appear 
mcomp ete. In that event, it may be poasible to close the tabular distribu- 
lon a some convenient pcHnt without doing violence to the data, thereby 
allowing the ogive to be completed. For example, we may close the United 
states age distribution at 100 years without injustice to the facts, since 
only a tiny fraction of all individuals exceed the century mark. To close 
the cumulative polygon at 100, we would therefore require only several 
aaditioaal five-year intervals cm the base line. On the other hand, in 
88 


1 4^ CuMCiATivE Frequency Poi.toon 

2. (a) ^Miat problems arise from unequal intervals in constructing a CFP7 

(b) Are these problems identical ^th those of the simple frequency polj’gon? 

3. (a) Would it be posrible to plot a cumulative column diagram, analogous to 

the histogram? Illustrate. 

(b) When would such a figure be particularly appropriate? 

(c) WTiat are its limitations? 

4. Is it possible to deduce the shape of the simple frequency polygon from the 
cumulative polygon? Explain and illustrate. 

5. Explain why the “more-than, less-than” terminology creates difficulties when 
applied to discrete data. 

Divorces and Annvlmenlt by Number of Years 
Table 4.2.3 ilfamed, Percentage Disfn'bufjon, £S Slates, 195S 


Years hfARRTxn 

Nuuber 

Per Cent 

Under 1 year 

9;260 

7.0% 

1 year 

13,485 

10.2 

2 years 

13,067 

98 

3 years 

10,790 

8.2 

4 years 

9,549 

7.2 

5 years 

9,291 

7.0 

6 years 

8,978 

6.7 

7 years 

8,107 

6.0 

S years 

5,293 

4.0 

9 years 

4,029 

3.0 

10 years 

3,905 

2.9 

11 years 

4,037 

3.0 

12 years 

3,557 

2.7 

13 years 

3,032 

28 

14 years 

2,505 

1.9 

15 years 

2,187 

1.6 

16 years 

2,283 

1.7 

17 years 

2,128 

1.6 

18 years 

1,831 

1.4 

19 years 

1,714 

18 

20-24 years 

6,137 

4.6 

25-29 years 

4,129 

3.1 

30-34 years 

2,073 

1.6 

35-39 years 

1,007 

.7 

40 years and over 

686 

8 

Total 

133,103 

100.0% 


Sour««: U.S. National Of6c« of Vital Slatiatica, ViZol StatUtiei e/Ute U.S., Spteial 
lUport*. Ditereti and Annnlmtntt, V'^, 42, No. 2, 1955, Tab)* 5. D.S. GoTammcnt 
Pnnlist Office, Waaliincton, D.C., 1953. 


91 



Giu»nic PREsr.xT^Ttos' 


lar to the baseline, which fixes the first qoartile, 13.7. This is the fiRC l)eIo\v 
which 25 per cent of the cases lie. The second and third quartiles are found 
analogously. Centile values may be nnulsTiy obtained.* Thus, to find 
the 90th centile value, Cto, we drop a perpendicular to the base line from 
a point on the curve that represents a “less than” cumulated frequency 
of 90 per cent; the required value la then read off the base line at the 
junction point. 

By reversing the foregoing procedure, the percentage above (or below) 
any particular value may be quickly estabh'shed. IV'e may seek, for 
instance, the percentage of tbe population under 21 years of age. We first 
locate 21 on the base line, at which point we erect a perpendicular to the 
curve. From that intersection we extend a perpendicular to the frequency 
axis, where we find that the percentage of persons under 21 years of age 
is 33 per cent. 

It is the ease with which each graphic sdutioos may be obtained that 
gives to the cumulative frequency polygon something of an advantage over 
the cumulative frequency table which would necessitate tedious inter- 
polation. 

Dittnbuiim of Fanilitt hy Jntmt, V.S; 1955 


Favitr iHcout 

Kowaea or Fasiiucs 
(m TflomNM) 

PtR CiNT 

Under \ 1,000 

3,300 

7.7% 

% 1,000- 1.999 

4500 

98 

2,000- 2,999 

4,700 


3,000v 3,999 

6500 


4,000- 4,999 

6.600 


5,000- 5,999 

S.400 


6,000- 6,999 

4.100 


7,000- 9,999 

5500 


10,000- 14,999 

2,100 


15,C<X> and over 

600 

1.4 

] Torat, 

42500 

100.0% 


Questions ano Problems 


t. Define the ioUowing concepts: 
Cumulative Frequency 
Ogve 
QusrUle 


Centile 

Quantile 


* CentDe* wUl b« dieoimed rnrtlter in Chapter 5, Section 2. 


90 



§ 4.3 Ghaphs of Quautatite Data 

Since the tabulated attributes have only one statistical dimension — 
namely, the frequency dimension —the corresponding bar chart requires 
only one scale, the frequency scale. This scale is usually marked off along 
the horizontal base line. Its range is of course determined by the largest 
frequency. The attributes are set down by name alongside the bars, and 
they are easily read when the bars are horizontally placed. 

Since there is no continuous quantitative scale (as there is in the histo- 
gram) to which the base of the bars must be fitted, the bars may be of any 
convenient width, placed in any plaurible order, and left physically un- 
connected. But by arranging bars from longest to shortest, the attributes 
are effectively ranked according to the frequency of their occurrence. The 
bars are drawn to uniform width merely to maximize the eloquence of the 
diagram; the dead space between them — ordinarily one-half bar width 
— serves to give emphasis to the discrete character of quahtative data. 

The Pie Chart. As the term suggests, the pie chart (Figure 4.3.1b) is a 
circular figure in which the areas of the respective slices are drawn propor- 
tional to the attribute frequencies. In constructing this chart, percentage 
frequencies are successively measured with a protractor along the circum- 
ference of 3C0', or 100 per cent From these cutting points, we draw radii 
to the center of the circle, which partition the total area into proportional 



Ficcte 4.3.tb Pie Chart, Rtligion Reported, US. Popula- 
tiem, 1957 

93 



Graphic Presen-tatios 


6. (a) Construct "less than” CFF’a of the East and West suicide rates on one 

graph. {Problem 14, p. 85). 

(b) Interpret the difference between the two contours. Are comparisons more 
eSective by means of cumulatwe w umpk frequency graphs? 

7. Plot Table 4.2.2 as an “or more” CPP. 

(a) How could the lower end be closed? 

(b) How many class intervala would be required to close the upper end? 

(c) From the graph, find the values that enclose the middle 50 per cent of 
tie families. 

(d) What per cent of all families recMve (8,000^)0 or more? 

8. (a) Cumulate the per cent frequencies shown in Table 4.2.3 in both directions 

and prepare corresponding cumulative polygons on the same set of axes. 
Verify that the graphs intersect at the 50 per cent marker. 

(h) What percentage of divorcea occurs after 10 years of marriage? 


Section Three 

Graphs of Qtialitattvt Data 

The graphic versions of qualitative data differ from those of quantitntive 
data in certain details, but not too drastically in fundamental principles. 
All graphic portrayals of qualitative data use length, area, or inteaeity of 
shading to represent frequency. Only three of the simplest types commonly 
encountered are here presented: (I) the 6ar chart, (2) the pie chart, and 
(3) the ilatistical map. 

The Bar Chart. The bat chart (Figure 4.3.1a) is simply a succession of 
evenly spaced bars whose lengths are proportional to attribute frequencies. 


Figtoe 4 3.1a Bar Chart, 
Religion Reported, Sample 
of U.S. Population, 1957 


Protestant 

Bomao 

Catholic 







92 



§ 4.3 Gr.^ph 3 Of Qcautativs Data 

may be produced by variegated cross-h&tchmg or by graded concentrations 
of simple dots. There are innumerable picturesque devices designed to 
exhibit the diversity among the mapping units. These are described in 
detail in specialized handbooks on the subject of graphic presentation and 
are not elaborated here. 

The principal problem in geographic mapping is a clean definition of 
the spatial unit. Units such as cities or states usually offer few difficulties, 
since the boundaries of these units are fixed by law. But such units as 
city blocks, cultural areas, national regions, types of residential districts, 
and natural ecological areas interpose many ambiguities which must be 
resolved before their meaning will be adequately conveyed. In spite of 
such problems, the social base map is an important tool of social investiga- 
tion; in fact, it is all but essential to the ecological school of sociology, 
which focuses on the spatial aspects of social beha\’ior. The social base 
map consists of those outlines considered essential to the presentation and 
understanding of the platted social data. Thus, the uneven territorial 
distribution of alcoholic psychosis b effectively shown in Figure 4.3.2, 
and poses significant sociological issues in regard to its etiology. 

Questions and Problems 

1. Define the foUon-ing eoneepts: 

Bar Chart Statistical Map 

Pie Chart Social Base Map 

2. Describe the distinctive features of graphs of qualitative data. 

3. Criticise Figure 4.3.3 as a graph of qualitative data. 



95 



CRAPinc Presextahon 


wedges. The effectiveness of the pie diart, or for that matter all qualita- 
tive charts, may often be enhanced by judicious and tasteful shading and 
coloring. 

The SialisUcal Map. The geographical variation in such items as popula- 
tion composition, matnage and divorce rates, crime rates, and economic 
indexes is always instructive to the social analyst. The systematic display 
of such variation may serve the purposes of social policy, as well as of 
scientific explanation. Thus, the discovery of a relatively high death rate 
in a certain geographic area, or the concentration of crime and delinquency 
in specified urban districts, constitutes a first step in uncovering the causes 
of the phenomenon. 

The map is the most natural vehicle for the purpose of setting forth 
geographic variation. The general principle of statistical mapping is to 
symbolise varying frequencies by appropriate density of shading, which 



Fiqdre 4.3 2 Aleoholie 
PtyeJwtit RoU» per 100,000 
Adxdt Populatim, Chiago, 
I9tg~i9Sl • 


Urban Artat, The University 



§ 4.3 Graphs op Qualitatttc Data 

Table 4.5 A Per Cent Non-While Populaiicfn by Slate, U.S., 1950 


State 

Per Cent 

State 

Per Cent 

New England 


Alabama 

32.1 

Maine 

0.3 

AllSElSSippl 

45.4 

New Hampshire 

0.2 

South Atlantic 


Vermont 

0-1 


13.9 

Massachusetts 

1.7 


166 

Rhode Island 

1.9 


?? ?! 

Connecticut 

2.7 

West Virginia 

5.8 



North Carolba 

26.6 



South Carolina 

38.9 

New Jersey 
Pennsylvania 

6.7 

6.1 

Georgia 

Florida 

21.8 





East North Central 


Arkansas 

22.4 

Ohio 

6.5 


330 

Indiana 

4.5 


9.0 

Illinois 

7.6 


12.8 

Michigan 

Wisconsin 

7.1 

1.2 

Jlfounfotn 





3.3 

West North Cenfraf 



1.-2 

Minnesota 

1.0 

Wyoming 

2.2 

lOAft 

0.8 

Colorado 

2.1 

Missouri 

7.6 

New Mexico 

7.5 

North Dakota 

1.8 

Ariton.a 

12.7 

South Dakota 

3.7 

Utah 

1.8 

Nebraska 

1.8 

Nevada 

6.4 

Kansas 

4.0 

Pacific 


i^ast aouth Ventral 


Washington 

2.7 

Kentucky 

6.9 

Oregon 

1.6 

Tennessee 

I6.I 

CulUornu 

6.3 


fmm. U.S. Durrku cf lh« Crntu*. 1^.5. Ccnmif of the PopiMinn' 1959. VoL II, CA/>ra.> 
(mifwt cf llu Populalton, Tmble 59. U.8. Gownunent PnntiAC OSief. Washington, I) C., 
1953 . 


1. Represent Table 4 J.l by a Iwr chart. 

(a) Shoulil the niilth of the bars be adjusted to the unequal age intervaU? 

(b) Hotv should the tabulation lie cIo«cd7 

(c) Ilxplain the declining percentage of high school graduates with inerw’ing 
age. 

(d) flow should bars be orderedl 


97 













Graphic Presentatios 


Table 4.i.l 

Percentage of Specified Age Groups 
Completing High School, V£., 
1950 


Table 

Percentage of Specified Income 
Groups Completing HtgK School, 
llalet S5 and Over, U.S., 1950 


Table 4.i.i 

Rale of Plural Births by Age of 
Mother, n'Atfe PopuioJion, U,S., 
i9U 


96 


Age 

Per Cent 

14-17 

2.4 

18-24 

50.3 

25-34 

49.1 

35-14 

37.2 

45-54 

27.7 

55 and over 

19.2 


Source: tIS. Bureau of the Ceneue, f/.5. 
Ceiuue ef the Fopuiattan: 1350. Vo). IV, 
Spcc^ Repfie. SJjealten. VJS. Gorern- 
■MDt PrintlOf Office, WtehiagtoR, DC., 
I0S3 


Ikcoue in 1919 

Per Cent 

Loss thAD S500 

15.0 

t 600-1,999 

18.6 

2.(l0(>-2,999 

280 

3,00«>-3,9I» 

39.3 

4,000-1.999 

43.4 

6,000-6,999 

68,5 

7,000 and over 

60.5 

Not reported 

24.5 


Beurce. U.9. Bureau of the Ceniue, V.S 
Ccueue 0/ the Bepuioitoe.* IP^r), Vol. IV, 
Speful Kepo^, RSMtvm, U,!!. Covem- 
meni Briaunt Office, Wuhiagton, D,C,< 
I9$3. 


Ace or Mother 

Hate Per 
100.000 Births 

10-14 

331.1 

lS-19 

549.6 

20-24 

774.8 

25-29 

1,019.8 

30-31 

1,310.7 

35-39 

1,589.3 

40-11 

1,203.7 

45-19 

721.0 

SO & over 

0.0 

Au, .\ges 

1,015.2 


Source. C.8. Notional Office of Vital Sle- 
tnttca. VVoJ .Srarufico 0 / tA« US. Speaal 
Kepertf. NoJvjnaf Sommorfe., Plurof Birth 
aaltelirt. V-S and Eaeh Stole. 1944. Vol 25. 
No. 18, Table D, Ui! Goeemraent PnnUnf 
Office. Waohington. D C., 1917. 



1S70 'SO *90 1900 *10 *20 ’30 ’40 *50 *60 

TEAR 

FiacRC 4.4.1 Timt ChaH, Diforttt per lOO ^ianvtgtt, VS., StUeitd 
Yrer$, J870-i935 • 


ni!e may actually operate to conceal the verj' iluetuations which the graph 
hopes to portray. Thus, the trend line on the scale pro\*ided in Figure 
•1.4.2a displays only gentle fluctuations in the annual United States death 
rate. But such fluetuation.s are probably more signifleant than the coarse 
scale seems to indicate. Tliis discrepancy may Iw righted by the in.«cr* 
tion of a finer scale which throws the annual vacillations into higher relief 
(Figure 4.4.2b). Now, in order to "prcfcrx-c** the lero origin and still 
not cnbrge the graph to unwieldy proportions, we introduce a treat on 
the vertical axU. A time chart delineating the monthly \-arialion in the 
Unitwi States birth rate u«e* this little dexice to good advantage (Figure 
4.4.3). What we have done, in effect, is to place the bu«ine« part of the 
graph under a microscoiie In order to magnify the trend line. 

However, this is not alwaj's an ideal solution. While it is true tint hy 
means of this technique relatively small x-ariatioas maj' l*e made more 

• I'rom Ti?. Purrao of the Cenwis. Stdirttret tflKe l‘J{ , 1937, U£. Cortm- 

ment Pnntirs OEfe, WeihLigton, D C,, lasT, p. TX 


99 


Graphic Presentation 


5. (») Plot Table 4.3.2 as a bar eharL _ 

(b) Income usually increases with age. A» of 1050. the higher income grwps 
show higher educational levels; the older age groups show lower educational 
levels. How, then, do you account for the increasing percentage of high 
school graduates in the higher income groups? 


6. Prepare a bar chart in vertical powtion from the data of Table 4.3.3. Draw a 
line parallel to the horiiontal ails through the over.aIl average of 1,015.2. 

7. Prepare a statistical map of the United State* to show the percentage of non- 
white population by stale (TaUc 4A4). Uae the following shading: state* 
under 15 per cent non-white, 15-29 per eent, liffM gray; 30-39 per cent, 
dark gray; 40 per cent and over, blaek. 


Sechov Four 
Tme Graphs 

Time series are ordinarily not presented a.s table*, but nre rather con* 
verted into graphs — tor the simple reason that extended time series are 
difficult to read as tables, and extremely short series are not significant 
anyway. The direction, rale, and fluctuation of a trend may be much 
more readily disccmed from a graph than from a series of numbers. 

Time graphs are of two prex-aJent forms; (1) arithmetic charts; and 
(2) semi-logarithmic (or simply “semi-log”) charts. 

The .dnthmcfic Time Chart. This b analogous to the simple frequency 
graph, except that we plot quantitative observations by intervals of lime 
instead of plotting frequencies on class intervals. The interval* of time are 
plotted along the base line, and the variable whose fluctuations in time 
are being recorded is plotted on the vertical axis. In Figure 4.4.1, which 
exhibits the recent trend in the divorce rate in America, the ba.se line 
represents the period from 1870 to 1960, and the vertical scale ranges from 
0-35 divorces per 100 marriages. 

The time scale should be laid out so os to accommodate comfortably 
uniform time intervals in the series; the veKicai scale, beginning with the 
lero origin, should similarly accommodate the largest quantity in the 
series Entries should, of course, be plotted over the midpoints of the time 
intervals within which the obsicrvaUon b made. 

t\'heii the time senes is composed of magnitudes or fr«iuencies which 
move within a narrow r.inge, npd adherence to the forenamed zero-origin 
98 



S 4.4 Tius Graphs 



Ficcbe 4.4J Time Chart, MonMy Variolim in Birth Rote, V^-, 
1B49~19S6 • 


of the time series. An illustration of that technique is given in Figure 
4.4.4, ^here eucceesi\*e columns symbolize the number of marriages in 
the United States each month. 

The time graph, which U merely a caJendrical series of observations, 
should not be confused ti-ith a bivariate distribution — such as j-ears 
married and family size — in which the base line represents duration of 
time. ^Micn time is a genuine variable in such a bivariate distribution, 
it has a true zero point, as in age or any other measure of duration which 
may be replicated indefinitely. In the time series, however, the markers 
are no more than arbitrary tags for unique moments in time. Keverthe- 
less, such instants in time are often treated as durations in order to fit 
a smooth trend line to the obsen’ed data. 

Afu/ripfe Gra'ph. Under some circumstances, it will be possible to plot 
two or more time series to the same arithmetic scale in order to exhibit 
more clearly the relation between them. Thus, in Figure 4.4.5, the rela- 
tion between the divorce rate and the cmptejment rate of married women 
is rendered more livid by this device. This graph is readily comprehen- 
sible because the two variables have approximately the same range and 
general location on the scale. 

Howe\’er, this procedure must be undertaken with some caution, for 
it may mislead the reader when variables are not identically located. In 
Figure 4.4.6, the chart may create the impression that in 1945-46 the 

• From U.S. National Office of Vita) Statistka, VOol SlalUHet S}*cial BeporU, VoL 
48, No. 14, 1988, U.S. GoTemraent PriotinK Office, Washiajlon, D.C., 1958. 


101 



Gs.irtnc Pmsentatios 



Fior«E 4.4.2a Time Chari (n'tlAout Bko*). Dealh Rai^, U£., 
I0g(hl9S0 • 

visible, it is also true that the movement of the trend line is no longer 
properly related to the plotted xcro origin, which is now an “origin" in 
name only. Tinkering with tlie vertical scale will distort the ratios among 
successise entries and thereby etaggerate or tninimirc the relative varia- 
tion However, the practiced eye may very well be able to compensate 
for such visual distortions. 

Instead of a trend line, the lime chart may consist of a set of separate 
columns or bars, whose heights are proportional to the respective ^-alues 



TEAR 


Figure 4.4.2b Time Chart (TTttft Brtal), Death Rale. U.S., IffgO-JSSO 
'Trom S National OtBce ct Vital Stitirtira, Vilai 0 / Ihe V.S^ Special 

ffepiyrle, S„m>nary 0 / Merlaltif SlatuiKt, mO, Vol Sr. Nq. S, Table 1, VS. 

Uovemment Pritiung Office, Washington, D C, 1953 
100 



1800 1900 mo 1020 1930 1910 1950 

TUB 

Figcke 4.4J Tim* Chart, Dirorcf Rate end Pcrctniagt of Mamed 
TTomm CainfuUi/ Employed, V£., SeUded Ytare, iSOO-lOSO * 


Before attemptiog to graph a time ecries on semj-log paper, the student 
should first familiarize himself with the principles underlying the linear 
arrangement of the guidelines and the markers attached thereto. Initial 
inspection of the semi-log paper b likely to produce a confused impression, 
which will be dispelled, honTTer, upon a careful analj-sb of the logic of 
the grid scheme. 

A t j’pical grid sheet consbts of two or more cycles which are recognizable 
as identical linear groupings. The horizontal guidelines within the cycle 
arc not equally spaced, but are so laid down that any specific numerical 
ratio between two markers alwajTJ corresponds to the same scale distance: 
the dbtance from 2 to 4 is exactly equal to the distance from 1 to 2, 3 to C, 
4i to 9, 300 to C>00, and so on. Consequently, any series of ab«olute \-alue*, 

• From Rufr»u of tbe Crow*, Sioluaeot Abrtnrt e/ /5'57, CoT^m- 

fnrrt Printina OSor, WMbirtWn, D C., 1057 . p 73 


103 


Graphic Presentation 



Fiavue 4,4 4 Tin* Chari, Variation »n Rate, 

u,s., me 


marriage rate was rising aliout 5 times os rapidly ns was the divorce rate, 
whereas, relatively speaking, it was rising only twice os rapidly. This 
false impression results from the fact that tlie two sets of data arc located 
at very uneriual distances from the tcro origin. To compare the rates of 
change, we have to allow for this locational difference, but this cannot 
readily be done by the eye. It is the function of the scmi-logarithmic 
chart to transform the scale »> as to facilitate comparisons between rates 
of change 

The Semt-logartihmie Chart. In the arithmetic time chart, only the abso- 
lute, not the relative, incrementsofcliangc arc directly measurable. Hence, 
another graphic device is required to measure relative growth and decline; 
or, to pul It more technically, to measure the rate of change in a given 
variable. This is the semt-foTarttAinic, or ratio cAort. It ret cals at a glance 
whether the rate of change is constant, or whether th.at rate itself is chang- 
ing When the rate of change is constant, as in a geometric series, the 
graph is a straight line. It is therefore especially appropriate to data of 
this form, such as the classic idealized Malthusian population series. 

102 



Ciufinc PKE«6ftT-»ncfff 





Figure 4.4.6 Timt Chart, Flurtuation in Hatriafe and Diratve Flale, 
, ioy>-i9S6 * 


which are plottwl on this ratio wale, will aotomatically show the relative 
sue of suectsshe increments Hie student should O-yperimenlBlly fanril- 
larize himself with the loearithmie scale by marking off on a strip of paper 
the distance from, a^y, I to 2 and checking >t against other identical 
ratios, such as 2 to 4, 3 to C, etc. 

Another diltdrenco betn-een the ratio and arithmetic scales lies in the 
numerical origin, which, in the former, must be a value greater than zero. 
However, for purely practical reasons, it b usually designated as a power 
of 10' 1, 10, 100; .01, .001, and so on. Since the semi-logarithmic chart 
measures relative changes, it b impos^ble to use the ccro or'igiti; lor any 
vftine multiplied by zero, yields tero, and hence would never get off the 
baseline. The Malthusian popubtion could never mulliply if it originated 
with zero. 


The mathematical foundation of the ratio chart (Figure 4.4.7) is the 
principle of romtnon logarithms to the base 10. Ifewever, it will not be 
necessary for the student at this time to be intimately familiar with the 
proc^ure of ruling off a logarithmic scale. Prefabricated sheets are com- 
mctcially availabie, and the student need merely plot the ahsolate data 
from the tabulation. 


■^e markers are entered or the scale in molUpfcs of 10 so as to accora- 

data b 

the nvatkers would be written eo that S = SO, 4 = 40, and ao forth. 
If the range extends from 30 to 120, two cycles would be needed, the second 
beginning with 100 and extending upward to 1.000. 

Sta^s. riiol Slal«l.« of 1A» V.S.. Special 
Warrwj** oml Otwrew. 195$, VoU 4.8, No. X U 8. Gor- 

rfnmentPnntin* Office, W»*lLmgton,D.C., 1*5^ Table 1. 


104 




im IMS mo ms im lass jmo loro lors 


Ficche 4.4.7 Chert, CotUgr Enrollrnmt and Popu- 

lation, Agts lS-2i, I'-S.. l$9(h~l$$7, Prcjertrd to 1.975 • 


In Figure 4.4.7, the colloge-age population and the actual college enroll- 
ments are plotted on the ratio scale. U b easy to read off the semi-log 
chart that the college-age population has remained almost stationarj’, but 
that college enrollments have more than doubled betnwn 1930 and 1950- 
In thb case, the disproportionate increase in enrollment, as related to 
population growth, stands forth in bold relief. No arithmetic chart, much 
less a tabulation, could so clearly project the historical facts. Although 
we could laboriously obtain the absolute figure^ we are here not primarily 
intcnr'ted in such arithmetic data, nor b the chart adapted to such readings. 
It b, however, expressly de«igned to rc%'cal rebtjve changes in the trend. 
Hence, it b uniquely suited to compare relative shifts eitJier within a given 
trend, or in two or more trends, irrespective of any divergence in absolute 
values. 

In Figure 4.4.8, the trend of maternal mortality for non-whites b readily 
compared with the trend for whites, and indicates that the maternal 
mortality has declined at about the same rate for both color groups. And 

• From ^!etropolJlaft Life Intmace Co, Slatittieel BvSelin, VoL 38, Aucutt, J9S7. 

105 



W15 •20 


•30 


'40 


'60 


•SO 

TtAR 

Fiqckc 4.4.3 <Smi4o0orit&mie Chart, Moltmal Mer- 
talxly Rate, ITAiJ* and Kan-WhUe, Birth Reguiratim 
SlaUt, t9lS-t9e9 • 

even when the trends ore located in widely ecparsted bells, it may still be 
found convenient to plot them in identical cycles, setting up the appropriate 
multiple scales on both left and right vertical asea — always assuming that 
the disposition of the curves does not lead to confusion rather than clarity 
(Exercise 9, p. 108). 

As in other comparisons, the juxtapoNtion of two or more sets of data 
may suggest “causative” explanations wWch may be checked by further 
investigation. However, the sociogrsphic picture, as here portrayed, is 
only the first stage in the analyris; it eupplies an indication of the need 
for further exploration. Obviously, in the above instance population 
change does not account for the increase in college enrollment. 

'From 17 S, NaUonsl Office of Vital Stabatirs, Vital SlatuUct of the UB., Spteiai 
RtporU, Nattonal Summina, t9M. Ualemdl MorUUili/, Vol. 4S, No. IS, Table A, 
U.8. Govemraent Printing Office, WaAington, D.C., 1958. 

106 



54.4 Tna: Geaths 


The proper interpretatioii of the semi-log chart assumes an appreciable 
amouat of statistical traming. However, even with a modicum of e.tperi- 
ence a few simple cur\’e-pattems may be identified: (1) a sloping straight 
line indicates changeat a constant rote; (2) an upward concave curve, change 
at an increasing rate, and (3) an upward convex pattern, change at a decreos- 
ing rate (Figure 4.4.9). 



(1) (2) (3) 

FrccRE 4.4.9 Semi^eganthmie Curve Patiemi 


Questions and Problems 

1. Define the following concepts: 

Zero-Origin Rule 
Scale Break 
Multiple Graph 
Arithmetic S^e 
Arithmetic Time Chart 
Ratio Chart 
Logarithmic Scale 
Cj'cle 

Constant Rate of Change 
Changing Rate of Change 

3. Discuss the hasards iovolrej in plotting (wo or more time series on the same 
arithmetic chart. 

3. In Figure 4.4.1, in what senseis the trend Une inferential rather than descriptive? 

4. Construct ample time charts for the time series giren in Table 4.4.1. 

5. Plot the birth rates of India and the United States (Table 4.4.2) on a single 
arithmetic time chart. Interpret. 

6. Interpret the following semi-log trend lines: 

(a) horuontal Ime 

(b) steeply sloping straight line 

(c) concave sloping downward 


107 




CaiPine PscszirrjtTion 


Table 4.4.1 

Cancer Morlality Rales, 
TTAjI* J/ofM and Fe- 
males, Ages i-74, MeCro- 
polilan Life Insvranee 
Industrial Policy Hold- 
ers, I9II-1955 


Yeak 

Deatiis pen 100,000 ** 

.Vafe 

FemaU 

1911-1915 

CC.6 

01.4 

I91&-I930 

C8.4 

89.9 

mi-1925 

75 -t 

8.8.0 

1926-1930 

80.6 

89 0 

1931-1935 

858 

890 

1936-1910 

86 8 

69.0 

1911-1911 

854 

82.5 

1916-1930* 

92.3 

79.9 

1951-1955* 

978 

75.6 


* Du»d on dth ll*vi«on of tbo JoUrnotiOBol Lut of 
CouAM of Dooth. 


** D^tk ntoo oinodortfiinl for oio. 

Sourtra Moltofioliioa Lif* Injursnce Compnnr, Sla- 
luttad July t»U; F~ A L«<r »nd &f. Piurcrl- 

moo, “Tho Morulxv lUporvtiro of InJiutnol rolifT- 
boldrra. Sorvfy */ Ach-antt rrunojdioM. 

VoL«.Moy 1M7. 


7. TVby u tbe wnu*Ios rhAft to thr fompsrucn t>I fstrs of 

KTOkth? 

8. IMty IS the htalthusiso populstion eenrs esIM » t’romctrie i>rri«T 

9 Plot the time series shown in T*Ue 4.'U on semi-log psper, using appropriate 
mulUple scales. 



108 




H-4 Time Gkapbs 


Tttble4A.3 

Y'eak 

CauroRKu. 

Kansas 

San Diego 

Census of Population (tn 

1890 

1,213 

1,423 

16 

Thousands), California, 

1900 

1,485 

1,470 

18 

Kansas, and San Diego, 

1910 

HA'S 

1,691 

40 

Decennial Years, 1890- 

1920 

3,427 

1,769 

74 

mo 

1930 

5,677 

1,881 

148 


1910 

6.907 

1,801 

203 


1950 

10,586 

1,905 

334 


Soum: Bumu of tbs Onsua. Slaiiaial AbiiraH 

«/ JOSS, 05. Gorerorasot Priatinj Offite, W»*b- 
isetoa. D.C.. 1956. TaUn 8 ud 11 


Selected References 


Crojrton, Frederick E., &Qd D. J. Cowden, dppft/d Gfitfol SUiiistie*. 
Second edition. Preatice-H«ll, Inc., Englewood Cliffs, SJ., 1956. 
Cbepters 4-6. 

ModJcF, Rudolf, end Dyno Lowenatein, Pieloorapfn and Gnp\*. Harper 
& Brothers, New York, 1952. 

Schiaid, Calvia F., Handbook of GnpM< Pr«fT»laft«»- The Ronald Pre« 
Company, New Yorl^ I9M. 

Spear, Mary Eleanor, Charting Slati3tic$. MeCraw-IEU Book Company, 
loe. New York, 1952. 


109 



Averages 


Sectiom One 

The Mode 


Meaaurea of Location. The frequency distributions of quantitative data 
which have been previously analyzed constitute condensations of large 
masses of observations. Basically, they are nothing more than series of 
values deployed on a cootinuwis scale. We may therefore declare that 
the items are located on a segment of the scale. Such a description becomes 
particularly meaningful when we compare two or more distributions which 
are located in diSerent regions of the same scale. Thus, when husbands 
and wives are classified by age and located on the same age scale, the 
husbands, who are usually older, occupy the upper end of the scale, and 
wives, being younger, occupy the lower end. Similarly, whites and non- 
whites in the United States are differently located on the wage continuum' 
(Figure 5.1.1). It is clear that the statistical concept, focolion, is con- 
cerned only with quantitative variables, and b not applicable to qualitative 
data, which have no scale to be located on. 

But when making these comparisons, it b not always practical to quote 
or depict the full distribution, however compactly it may be presented in 
tabubr or graphic form. For many purposes, the complete table is un- 
necessarily detailed, too cumbersome to manipulate, and not always 
readily comparable to other tables analogous data. In accordance with 
the general function of statistics to amplify large masses of data, we need 
a more condensed statement that will (1) provide information on the 
locational value in which we are interested, (2) eliminate those values 
which ate at the moment iirelevwvt, and (3) will still failWnWy represent 
the totality with reasonable eflicieni^. 

The limit of such efficient condensation would obviously be the reduc- 
110 



1 5.1 The Mode 



IKCOUE IS TH0CSASD9 Of DOlXAEa 


Fiocbe 5.L1 Annval Fomfly Inetme, PnariUifft ZHiJn- 
6u/ior, TTAite and Xon-Whitf, Vticn VS., I?SS * 


tion of the multitude of items to one single toIuc which would, m some 
represent the entire aggregate. But it is clear that no sing e 'V'a ue i? 
efficiently versatile to reflect every characteristic of location of a distnlm- 
tion; instead, it can reflect only one feature of it. The representative > alue 
i-'* not, therefore, a replica or miniature of the total; it is rat er a ^ 
'^ue of limited utility which will “do the work” for the tota itj. len 
this task b to fix the location of the dbtribution along the scale, then the 


'•alue b called a measure 0/ location. . . v. . ;f 

Any value in the dbtribution could sen'c to represent t e o ‘ 

»e knew the position of tliat item in the full array. Hence, wc mu- 
analyze all of the items in the aggregate in order to the reprt-enta- 

ti'Tncss of any one. But, in practice, not all values will be cqua > servace- 
»h!e, even though their repre^tatbrness has been determined; rather 
*hree ij-pes of values have been found most convenient and useful to 


• rroa Hurrio of the Ori«u«, Currrnl PtfpidHton firporU, 
/mwv. vs. CortmBenl Prir.tina OS^e. tt Mhinctor. 


, .<Unn r-"’, 
D.C, 1957. 



Ill 



AVERAOEa 


abstract from a tabulation: (1) the martmum, (2) the rntnimum, and 
(3) the central or typical valuea, known as averagtt. 

The Maximum. There are occasions when the maTimum value in a dia- 
tributioa is the only useful one. lo the fluctuations of tratTic volume and 
weight, it is the peak load which must be safely accommodated on the 
highway or bridge; the “average’* is not a satisfactory guide, since the 
bridge built for the average load would collapse under the maximum 
traffic. Similarly, the capacity of schools, hospitals, and other institutions 
is planned for the approximate maximum, not for the anticipated average 
patronage. Middle values are not safe for such purposes. The maximum, 
therefore, as a measure of location, constitutes that toIuc below which lie 
the temporarily irrelevant values. 

The Minimum. Many problems of social policy are resolved by selecting 
the minimum value of a distribution as a working norm. Obviously, the 
minimum is that value above which all other values in the array may be 
found. From the distribution of incomes of healthy families, the minimum 
income is selected (or legislated welfare norms. From the hypothetical 
distribution of mature individuab by age, the minimum age of eligibility 
is set for mamage, army service, voting, and other social respomibilltirs. 
A prospective college student may be interested only in the minimum of 
the array of college expenditures and use it as a guide in making his plans 
The other values arc disregarded. 

Neither the maximum nor minimum oiters serious computational or 
statistical complications either in reckoning or interpretation. They arc 
ample in conception; hence, they require no extended discussion. 

The Concept of Acerage. Any value between the extremes could be selected 
as a locational measure. However, the moe-t common intermediate loca- 
tional value, and in general the roost useful, is an arcrage. Thii Is generally 
a central value around which the distribution seems to cluster. This 
apparent tendency of many statistical aggregates to concentrate around 
a center is often termed “central tendency," the value at this center being 
the mecuure of centra/ tendency, more commonly called oteroje. 

This functional center, however, is not nece.'=.'5arily identical with the 
middle region of the range of observed data. The region of concentration 
may be either near the midpoint of the range or at tv con';iderahle distance 
from It. Thus, the distribution of intelligence scores resembles a bell- 
shaped curve which is centered on the midpoint of the range. On the other 
hand, the distribution of wages, sizes of cities, and other social variables 
may w J-vaivta wbor* wAets nte disjAaced toward VVit 

edges of the range, as in the sketches in Figure 5.1.2. 

As m common usage, so also in atatistical language, the concept “a'ver- 
112 



$5.1 The Mode 




FioCEE 6.1.2 TyT)«* «S Frequency Curxft 


age" coimotefl the t;pleal, the ordinaiy, and expected. Like roost other 
Btatistieal measures, the average has its roots io common experience and 
is indispensable to daily discourse. Every layman summarizes and simpli- 
fies the whole range of lus experiences by speaking quite casually about 
the average voter, the average family, the average student, a batting 
average, or may refer to almost any experience as “just average. The 
'ride variety of traits which are reducible to an average is shown in such 
popular descriptions as that of the “average” American male who stanck 
five feet nine inches, weighs 158, prefers brunettes, baseball, beefsteak, and 
French fries, and thinks the ability to run a home efficiently is the most 
important quality of a wife.” 

There is, of course,* no male who is average in every respect: the ^rson 
of average height will not necessarily be a person of average intelligence 
or average handsomeness. Therefore, the popular claim that the “a>-erage 
person” does not e.xist is completely justified. Such unrealistic expressions 
“ “average school” and “average Negro” do not conform to the stnctCT 
sUtbtical concept which applies only to a series of measures on a single 
'•ariable. 

J'Vertheless, in whatever manner be emploj-s the concept, the lajToan 
unconsciously implies what every statistician explicitly recognizes: l^t 
the ax-erage is a kind of norm around which the values tend to va^. But 
the difference between the lajinan and the professional is that the latter 
tvquires more precision than is provided by informal folk usage. e 

113 




Ateraoes 


therefore devises mathematical procedurts to measure the average, and 
thereby restricts himself to variables whose central tendency is reducible 
to some form of quantification. Since there are various types of central 
tendency, he necessarily formulates various averages, each answering to 
the requirements of a given problem. Of these numerous averages, only 
three are here discussed: the mode, tfie median, and the arithmetic mean. 
The arithmetic mean is the best known and most widely used in statistical 
work; however, the mode and the median are conceptually more ele- 
mentary and for that reason will be taken up first. 


The Made, or Prahability Average 

The mode is rimply the most frequently recurring value in an ordered 
distribution; it is that value where the concentration of items is most 
dense. Etymologically, it is related to the notion of the prevailing fashion 
of dress or etiquette to which a majority of a given social class would be 
expected to conform. Ifence, tbc mode (Afo) may also be defined as the 
most probable value, and therefore distinctively labeled the probability 
average. 

An examiiutioa of popular expressions suggests that the mode is, in 
fact, often implied by the concept “average.” Such usage stems, in part, 
from the fact that attributes as well as variates may take on predominant 
frequencies in a series of observations When a politician refers to the 
"average voter” as wanting his interests protected, he usually means that 
a majority of voters are motivated by self-bterest; or when a waitress 
remarks that the "average customer” does not drink his co5ee black, she 
is likely to mean that most restaurant patrons do not drink black coffee. 
Restaurant patrons and voters, in thrir turn, speak of the “average 
Ymitress” and the "average politician.” 

Statistically speaking, the mode is the most hkely value to be met with 
m a series of recorded observations. Suppose, for example, a citizen of 
Peoria were to estimate the suicide rate of hb community, having in his 
possession only the table of suiride rates f<w the group of 107 cities (Table 
3.1. Id). His best guess in tlus instance would be the most frequently 
occurring rate, somewhere between 8.5 and 11.5.’ By selecting the interval 
with the largest single frequency (28 per cent), to that extent, he would 
more likely be right than wrong. The probability of success would depend, 
of course, on the degree of preponderance of the most frequently occurring 
value in the given distributiOTJ. If, instead of 23 per cent, the preponderant 
frequency were 15 per cent or 75 per cent of the total, the probability of 
being conect would be dinuniahed or increased accordingly. But the modal 
value would still remain the same, irre^Kctive of the size of the plurality 
of the most numerous category. Thus, the mode Is the most probable 
value one would encounter, but it does not reveal the degree of that proba- 
114 



{5.1 The Mode 

bility. Clearly, if the frequencies of every value were identical there would 
be so mode. 

Colculaf:(Wi of the Mode. Since the mode is the most frequently occurring 
value, it is ob\’iously necessary to count the number of occurrences of each 
value. In the case of continuous data, it is conceivable that empirical 
measurements would be so minute and discriminating that no two cases 
in the set would be found to be identical in value. An examination of the 
107 suicide rates reveals that very* few rates are indeed identical when they 
are measured to two decimal places. If, however, a little precision is 
sacrificed by rounding and grouping, a mode very quickly emerges. In 
fact, a mode is impossible without grouping, since without grouping there 
would be no frequencies. It goes without sajdng that class intervals must 
be identical in width; otherwise one could, by making a particular inten*al 
large enough, obtain an apparent mode of almost any desired value a 
clearly meaningless result. 

There are, then, two distinct steps in the determination of the mode: 
(1) locate the predominant, or modal, frequency, and (2) find the %alue 
corresponding to that frequency. This is illustrated in Tables 5.1.1a 
and 5.1.1b. 


Taile 5.1.1a 

Compulation of Crude Mode 


Toile 5.1.1b 

Compulalion of Crude Mode 
{Dala Recouped) 


Class 


ISTERTAl 


Sl-2 

5 

5-1 

6 

5-0 

2 


13 


Mode - Mdpt. of *2.60 - 4.50 
- $3.50 


Class 

Inteetal 

f 

$l 

2 


3 

3 

5 

4 


5 


6 



13 

Mode - Mdpt. of J2.M - 3.^0 

1 -$3in 



115 




Atcraoes 


In Table 5.1.1a, the highest frequency is 6, and the modal value, or 
crude mode, is $3.50, which is the nudpoint of that interval, 2.5-1.5. How- 
ever, nith the finer grouping rf Table 5.1.1b, the highest frequency is 5, 
and the midpoint of the corresponding interval is $3.00. It immediately 
becomes evident that, since there is some flexibility in the choice of class 
width, there will ensue a certain instability in the resulting mode. 

In general, statisticians do not like such instability of values, resulting 
from an adventitious circumstance like the selected size of an inter\’al. 
Instability tends to discredit the very validity of any measure. To cir- 
cumvent this dilemma, two alternatives are available: (1) abandon that 
average and use another, or (2) refine the crude mode by a method of 
calculation which would reduce its instability. But with all its admitted 
instability, the mode cannot be abandoned in favor of another average 
if it is the mode that one wants. Hence, we must turn our attention to the 
refinement of that measure. 

The crudest method, discussed and illustrated above, dedgnatee the 
midpoint of the interval of the most populous classy it Ignores the im- 
mediately adjacent Inteni’aU and thw frequencies. - But these adjacent 
intervals would have affected the value of the mode if the class boundaries 
bad been phced otherwise. Hence, a more sensitive formula has been 
designed to reflect the ‘'pulling power” of these adjacent frequencies, and 
to allow the operation of their force. Its application presumably produces 
the result that would be obtained if Msm inlervab were made progressively 
smaller in order to secure a more stable and accurate approximation of the 
pomt of greatest density. 

The Difference Method. This method of refining the mode proceeds: (l)by 
calculating the diSerencea between the modal frequency and the re- 
spective adjacent class frequencies; (2) by calculating the ratio of one 
of these differences (usually the next lower) to the sum of the two differ- 
ences; (3) by applying thU proportion to the modal class width; this 
result, when (4) added to the true lower boundary of the modal interval, 
serves to fix the value of the refined mode. The formula: 

+ (5.1.1) 

where ilo — refined mode 

L the true lower linut rf the modal interval 
Di " the difference between the modal frequency and the 
frequency of the next lower interval 
— the difference between tbe modal frequency and the 
frequency eff the next higher interval 
t — the class interval 


116 



$5.1 The Mode 


Applying this formula to Tables 5.1.1a and 5.1.1b we obtain the following 
results: 


(a) (b) 

3/0 - *2.50 + X 2^ J/D-$2.50 + (;y~^*) 

- $2.50 + .40 - $2.50 + .33 

- $2.90 - $2.83 


The mode has now been "puUed" trom $3.50 and $3.00 to $2.90 and 32.83 
respectively, and thus displays a marked gain in stability. l\Tiero before 
the modes differed by 50 cents, they now differ by only 7 cents. The influ- 
ence of grouping is reduced by allowing the adjacent frequencies to par- 
ticipate in the result. 


Graph'ie MelAod. The graphic version of the difference method is illus- 
trated in Figures 5.1.3a and 5.1.3b. We first con%-ert each tabulation into 
a histogram. The procedure continues by connecting the true hmits of 
the columns adjacent to the modal column by diagonals in the manner 



I'lQCitc 5.1.3s Diffrrw .VfM Jer 
}iSc^U {[iitfd cn TaH* 5.Ma) 

117 



Athuoes 


In Table 5.1.1a, the highest frequency a 6, and the modal value, or 
crude mode, is S3.50, which is the midpoint of that interval, 2.5-4, 5. How- 
ever, with the finer grouping of Table 5.1.1b, the highest frequency is 5, 
and the midpoint of the corresponding interval is $3.00. It immediately 
becomes evident that, since there is some flexibility in the choice of class 
width, there will ensue a certain instability in the resulting mode. 

In general, statisticians do not like such instability of values, resulting 
from an adventitious circumstance like the selected size of an interval. 
Instabihty tends to discredit the very validity of any measure. To cir- 
cumvent this dilemma, two alternatives are available: (1) abandon that 
average and use another, or (2) refine the crude mode by a method of 
calculation which would reduce its instability. But with all its admitted 
instability, the mode cannot be abandoned in favor of another average 
if it is the mode that one wants. Hence, we must turn our attention to the 
refinement of that measure. 

The crudest method, discussed and illustrated above, designates the 
midpoint of the interval of the most populous class; it ignores the im* 
mediately adjacent intervals and their frequencies. • But these adjacent 
intervals would have affected the value of the mede if the class boundaries 
had been pjaced otherwise. Hence, a more sensitive formula has been 
designed to re&ect the "pulling power" of these adjacent frequencies, and 
to allow the operation of their force. Its application presumably produces 
the result that would be obtained if class intervals were made progressively 
smaller in order to secure a more stable and accurate approximation of the 
point of greatest density. 

The Difference Method. This method of refining the mode proceeds: (1) by 
calculating the differences between the modal frequency and the re- 
spective adjacent class frequencies; (2) by calculating the ratio of one 
of these differences (usually the next lower) to the sum of the two differ- 
ences; (3) by applying this proportion to the modal class width; this 
result, when (4) added to the true lower boundary of the modal interval, 
serves to fix the value of the refined mode. The formula: 

where Mo = refined mode 

L m the true lower linut irf the modal interval 
Di «• the difference between the modal frequency and the 
frequency of the next lower interval 
Dj “ the difference between the modal frequency and the 
frequency of the next higher interval 
s — the class interval 


116 



§5.1 The Mode 



uedian school tears completed 

riQDRE 5.1.4^ BiTTiodal Fretpimeg DMnMuri, Median Schaoi Yeari 
Completed, iis Cenns TracU, Indianapolie, 1950 


modes probably reflect prevalent terminal points in sc „ininium 

a large number leave school immediately after attaining 
“Be; another large concentration take their leave a w worker 

pleted high school. When confronted with such 

« often called upon to disentangle the ^ dciriiv 

failing that, he would have to accept the dual mod > 


tion of a angle population. 

Eralrali'm o/ Ih, Mod!. WTiile Ihe mode would 

for measuring representativeness — what is more rep - 

most frequent value? -some of its eharacterist.« ^.5^^ 

but the simplest purposes. First, it canriot sub«cqu • , . . ..j 

by algebraic rules because of its own dcriration ® P° ^ width 

come clearer when we dto the mean; (2) it « influenced bj the ^ndth 



Averages 



!FjotniE 5.1.3b DiStrtnte Mtthod Jot Drtmniriinp R«- 
iintd Mode (Boeed on ToUe 6.1.1b) 


etiovm. Theu, (tom tbe point where these diagonals intersect, vre drop a 
perpendicular to the base line. This perpendicular cuts the base line at 
the refined mode, which is simply read off the scale. Such a graphic 
presentation shows more clearly than does the arithmetic calculation how 
the discrepancy between the two differences determines the position of the 
mode: the greater that discrepancy, the greater the displacement from 
the crude mode (the midpoint of the modal interval). When there is no 
discrepancy, there is of course no displacement at all. 

jBimoialiiy, Some distributions display two concentrations or humps, and 
are therefore called bimodal to distinguish them from imimodal distribu- 
tions This bimodality of a ^ven distribution is generally the result of 
amalgamating two or mOTe populations which have markedly different 
locations on the scale. Thus, the biinodahty in the frequency polygon of 
aduit httghta M due to the toiisdhda.taoa dl grcmps rf maiea and ieTOaVes, 
who are characterised by two different sets heights. A graphic example 
of bunodality is furnished the frequency distribution of the average 
schoolmg for 115 Indianapolis census tracts (Figure 5.1.4). These two 
118 



i 52 The Mediah 


4. la the text, it was stated that it b impossible to detennine the degree of 
modality from the mode. Does statement still hold true when the mode 
b corrected according to the difference formula? 

5. In computing the rehned mode, what assumption b made concerning the 
dbtribution of the items in the modal class interval? 

6. list two instances of bimodal frequency dbtributions of socbl data. 

7. For what kind of data b the mode the only possible average? 

8. Can the degree of modality be approximately inferred from the value of the 
refined mode? For example, if the true modal family sire b 2.9, does thb 
not indicate that the most frequently appearing family sue b 2, but that it 
does not predominate much above the frequency of sire 37 

9. Compute the refined mode for the distribution of suicide rates (Table 3.1. Id) 
accorfing to Formula 5.1.1. 

10. Compute the refined mode of the dbtribution of delinquency rates (Problem 11, 
p. 47) by Formula 5.1 J. 


Section Two 

The Median^ or Position Average 

The Principle o/ Ordinal PosHicn. In any given array, every item holds 
a certain rank, be that the first, second, tenth, or Ecventy-«tb. Obviously, 
any particular numerical rank takes Us meaning and significance from the 
total number of ranks there are. A rank of 10 in a series of 100 b relatively 
higher than a rank of 10 in a group of 20. 

The pomt which cuts the array into two equal divbions, so that exactly 

one-half of the items are below, and one-half are above that point, is 
called the median. The “average” student, for e.xample, usually considers 
^“oself near the nuddle of the array with about 50 per cent of the students 
telow and above. Although, according to thb method of reckoning, the 
average student would aUo dbplay the most frequent score (i.e., the mode), 
the focus here b on the position in the rank order, rather than on his mem- 
^Rhip in the modal group. Therefore, the student who stands exactly 
“ the middle of the lot, with a grade of 80, for instance, is thought of m 
medbn student, and hb grade or score b the mcdbn value, or simply 
the median. In 1950, the medbn age of the total United States popubtion 
30.4 years, signifjing that one half of the population was older, and 
one half younger than that age. Since the medbn clearly denotes posiUon 
m a sequence of values, it b often referred to as a pasUian average. 


121 



Aveeaoes 


of the class interval employed, and therefore to that extent lacks stability; 
and (3) it is unable to show its d^ree of modality and is in that sense non- 
specific. We may legitimately raise a question about the serviceabihty 
of a measure which tells you ^e most frequent value, but not its relative 
weight in the distribution, ie., the degree of modality. It gives you the 
most probable value, hut offers no clue to how probable that value Is. If 
this is required, the whole distribution must again be examined. Finally, 
as to how large the predominant frequency should be in order to dignify 
it as a modal frequency, there is no rule but good practical sense. 

Any criticism that might be levied against the mode is not to deny that 
this measure is frequently quite useful. A houring administrator, planning 
a building program, will be interested in the predominant size of the families 
to be domiciled. He will not employ the mean size of all families, nor the 
median, but will estimate the most frequent size which is likely to occur 
among his renters. A young law graduate, in embarking on his career, 
will not estimate his probable income on the prevailing mean computed 
from all known incomes, but rather on the predominant income for his 
class, Clothing merchants will stock their stores according to (heir expeo- 
tations of modal patronage. The fact that the degree of modality may 
not be readily available does not alter the need for information on the 
predominant frequencies of the items of interest. The mode is, further- 
more, the only average which is applicable to qualitative variables. 

Although the technical terminology may not be employed by the hous- 
ing administrator, the aspiring lawyer, or the clothing merchant, his con- 
ceptual focus is still on the mode, which he informally and vaguely labels 
the "average." 


Questions and Problems 

1. Define the following concepts: 

Measure of Location 
Probability Average 
Modal Frequency 
Crude hlode 
Hefined Mode 
Degree of Modality 
Diflerence Method 
Bimodalily 

2. Under what circumstance does the r^sed mode equal the crude mode? 

3. For a given N, what would be the effect on degree of modality of the follovring 
circumstances? 

(a) a wider range of data with more concentration in the tail* 

(b) a narrower range, with less cmcentTation in the tails 
120 



5 5.2 The Median 

number of itema below and above the median would now be three. Count- 
ing off three items from either end, we would meet half-way between ranks 
3 and 4, or at 3.5, which is the true upper limit of the third and the true 
lower limit of the fourth rank. The corresponding point in the array of 
values is a point midway between 6 and 7. Since this interval extends 
from 4.5 to 7.5, the median point is exactly 6.00. Actually, any value 
between the midpoints 5 and 7 would satisfy the definition of the median, 
since the median is a point such that half of the ranked items are above and 
half below it. It is conventional, however, to locate the median exactly ha 
way between the values, a placement which is consistent with the even 
division of ranks.* 


Procedure in Grmiped Data. Most statistical problems are not as simple 
as those cited above; in fact, such simple problems are quite unrealist^ 
for statistical investigations generally involve large maKCs of 
data. Although in such cases, the procedure for computing the median 
is fundamentally the same as that for ungrouped data, it must nwessarily 
accommodate itself to the fact that the median will almost always 
somewhere within a class interval. This median interval itse wn 
readily identified from the cumulative frequency table, but it is still 
necessary to establish the median point within that interval. 

The cumulative frequency distribution of Table 5.2.2a will 
the method of dete rminin g the median age of the population o e 
States in 1950, (Observe that the frequencies have been round^ to toe 
nearest million, which is a type of coding tbat will not significant^ mnu- 

ence the median and will save much tedious work.) 

Conforming to the above directive, we first divide the to 
into halves: ^ - 75, Wc then locate the class interval of the gth, or 

Hie 75th case, by counting from either end. Cumulating from the 
find, the 75lh case is obviously not among the first 16 nasra, ^ frp. 

the first 62 cases, which represents the cumulation of the . 

quencies. The fourth cumulation of 86 overshoots the mark by a 

*Io lome texts, the median of unKronped data u located accordlne to the formu!^ 
which may be called the meiftod ff taae. “nii* would Jield for Tab e 

W la the median rank of 3, and for Table 52.1b, the with 

^ value, would bo 5 i^d 6 re^wctively "t^h the re^ 

thoie above, the method i, not recommeoded. U ha, led to me miKo »- 

2 + 1. t. • -....iMirv to the fundsroental 

Item, are below (and above) the median, which w contrary ^ 

Moreover, thi, formula ia insM»fieable to ^uoedd,^ ,ince tbeiw 

•»winpelUBgrea,oaforaeparatefornintailoT«as«>op«d“®P^“P^ ^ K 

differ only in tabular appearance. W. thetefore adhere uniformly to the ^ 
forin nl. 


123 



AvERAGsa 


Cakulatim 0 } the Median. Since the median divides the range into two 
parts, with each division cfm tjining exactly 50 per cent of the itenis, it is 
nothing more than the common true boundary between those two segments. 
There are therefore three essential steps in the determination of the median : 

N 

(1) arranging the values in rank order; (2) finding the position of the -^th 

item; (3) determining the ^ue 0 ! that item. 

With ungrouped data, involving only a few cases, the median coufd be 
found by inspecting the array, liet us suppose that .five persons have 
respectively 5, 2, 7, 4, and 10 dollars each. In order to find the median, 
we arrange the values in order of magnitude, and designate the position, 
or rank of each, as shown in Table 5.2.1a. According to the basic defini- 


'Jable d. 2 . 1 a 

X RaNK 


Compulation of Median, 

52 1 


Odd Number of Items 

4 2 

> 2J rankp below 


■ 5 3 — 



7 4 

10 5 

> 2i ranks above 


tion, the number of items above and betow the median fs in this case 2§ 

items. Hence, starting at either end, we count through the ^th, or 2ith 
rank, which places the half-way po'mt exactly in the middle of the third 
rank it bisects the third rank.* The cwresponding value, or the median, 
would be the midpoint of 5, or exactly 5.00. 

When the army contains an even number of items, a short additional 
step is required. If, for example, as item of $13 were added to the array 
of Table 5.2.1a, we would have tbe^tuation shown ia Table 5.2.1b. The 


Tabie 5.2.1b 

Compufahon of M€dian, 
Even Number of Items 


We »hou14 remind the reader that it ia quite C(»iventioiial to treat discrete daU. such 
a • ranks, as 1 / they were contuiooue. 

122 


3 ranks below 
— ---- Median Painl 
3 ranks above 



J S2 Tee Mediah 

Nor would tie result differ if we operated on percentage frequencies, 
instead of absolute frequencies. Since ^ is necessarily 50 when the calcu- 
lation is performed on percentages, tiie formula becomes: 

Md = L + X t) 

it being understood that / and ef stand for the appropriate percentage 
frequencies (see Table 5.2.2b). We note finally that, as in the above 


TaWe 52.2b 

Cmpuialim of Median 
dje, I/jS. Population, 
1950 


Age 

CrUCLATED 

Per CE.VT percentages 

25-34 

35-44 

45-M 

65-« 

65 and over 

10.7 tO-7 

16.1 26.8 

14.8 41.6 

15.8^ 57-4 

142 

n.5 

8.8 

8.1 

100.0% 

Jlfd-2 

-2 

-3 

n.S vears 


““pie, the IrKiuency table need not be closed in older to obtain the 
median. 


Jfrfian of DUcrete Data. Some writers would Umit the use of the m<d^ 
Jo continuous data, for the reason that discrete data 1^ c , 

^ fractionated as would be required for the median. But ^ch a P ' 
docs not seem too feasible. In the dberete tabulation of 
famihes by sire (Table 5.2.3), there is no obsciaed family size . 
exactly 50 per cent of the families are brger, and 50 per cent snn - 
find that 34 per cent of all families consist of 2 persons or ess, . -j. 

of 3 persons or less. Hence, some of the throo-iK^n famili« 
^Mow the 50 per cent point, and the remainder lie abo%e it 

which seems to exclude the possibility of a cl«n m lan - 
® this situation, must we then abandon the median, or rm . . 

*^lically treat the data as continuous and proceed to accept 

125 




Aysraoss 


Table 52 Jr 

Populatitm by Age, Cu- 
mulative Distribution, 

UJS; im 


Soujcv: Bnrnu of th» Ccnjus. Centiu of 

PopulotiM* lOSO, Vd. II, Cttaraderitliet of lJi« fopufo- 
twR. P*rl I. D^. Summary, Tthls 39. 17,3. GoTtromoat 

Fnaiiaf 0 £m, D.C., 1933. 

stanlial margin. The 75th case is therefore somewhere within the class 
frequency of 24, which is thought of as being spread uniformly throughout 
the interval, 25-34. Having accounted for 62 cases, we need to pass 
through 13 additionat items (75 minus €2) of the 24 in order to include the 
desired item, and to establish the position of the median. Starting at its 
true lower liinit, we must, therefore, peneUate the fourth class interval 
l^tb, or 54 per cent, of its width. Since the class interval is 10, the distance 
still to go to reach the median is .54 X 10, which is 5.4 yean. Adding this 
amount to the true lower limit of the class interval (25.0), the median is 
found to be 30.4 years. 

ITie entire procedure may be compactly written as follows: 

where Md = median 

L “ the true lower limit of the class interval in which the median 
is located 
N 

— one half of the total frequency 
cf = the cumulated frequency up to the median class interval 
/ = the frequency of the niedtaa class interval 
i = class width 

Substituting in this formula, we arrive at exactly the asms result as before’ 
3fd-25O + p^Xl0) 

= 25.0+5.4 
= 30.4 


ASE 

/ (iw MimoNs) 

Cf 

Under 5 

16 

16 

5-14 

24 

40 

15-24 

22 

62 

25-34 

24 

86 

35-44 

22 

103 

45-54 

17 

125 

55-64 

13 

13S 

65 and over 

12 

150 

Totai. 

150 


124 



{ 52 The Mediak 


even hundredth. We may state that Mr. Jones is in the upper half of the 
income distribution, but to place him in the highest 1 per cent is certam^ 
more informative, although both statements may be true. For such added 
dberimination, smaller subdivisions are required. The imputation of 
quarttTe$, dedles, and centika, which divide the array into fourths, tenths, 
and hundredths, respectively, is carried out according to mme pnn- 
ciple as in the case of the median, except that the appropriate requency, 

or percentage, of items below the point in question is substituted for 2 
in the formula pven above. For example, to find the^int below which 
the lowest quarter of the cases fall, we replace '4* Thds, the firs 
quartile in the age distribution shown in Table 5.2.2a is. 


Qj - L + 




- 5.0+ (5^4^ X 10) 

» 5.0 + 9.0 
••14 years 

If we wish to locate the point below which <5 per cent of the items fal 
(or Q,), we maVe the following substitution in the formula: 


Ut 90tk i^'nUle (or C«) ■oould be lound by the tonoula; 


Cso * 




Qwnltlet at Slandardiitd Meaiurtt. The of 

deciles, .nd eentiles, «bich by their debnitioa riidicatc ^ ^ , 
i.en« that .re locot'ed beio.- or .hove . f 

fenrd to as juanliVr*. As has been demonstrated i arrav. 

nay be u«ed to fix the relative position of any pN^en . 

bright of 180 pounds may be located at the 90th «:enlde C« -JS0 1J 
»hich places that weight in a position such th^ Pf*" an age 

tion are above that weight, and 90 per cent be 0 r,nivv> « niicide 
0:. IQ ,00, . height of * ft.. » in.. the 

rate of 22 per 100,000 — all may conceivably be loca 

»amc abstract centile point. , ,tardanlii«f 

It i< now evident that quantUcs must be rreegn 



Ateiuges 


fable 5 ^ .3 ComjnJalim, Uedian Family Size, Indiana, 1950 


Sira or FAumT 




1 

64,<H5 

7.6% 

7.6 

2 

222.t>47 

26.4 

34.0 

3 

185540 

225 

56.0 

4 

146,519 

17.5 


5 

95,757 

11.4 


6 

57514 

6.9 


7 

32,916 

3.9 


8 

1S500 

2.1 


9 

10585 

15 


10 

5519 

.6 


11 

2,469 



12 

1555 

842,000 

.2 

100.0% 


Afd - 25 + X l) 

- 2.5 + .73 
«• 353 persons 



value as the mediaa? The simple answer is that we follow the latter alter* 
native, as we do with every other average. We do this to make finer dis- 
tinctions than would be possible with whole numbers. 

If, (or example, the 48 states in 1950 were to be ranked by median size 
of family, it is obvious that fractional values would have to be employed. 
Our only option is to treat the data ae continuous in order to satisfy the 
requirements of the problem which has been set up. Operating on this 
practical principle, we find the median of the above tabulation to be 3.23, 
a result which presumably would serve to establish Indiana's exact rank 
order in American family fertility. 

The median is a simple concept: 50 per cent of the items are smaller in 
value, and 60 per cent are larger. Furthermore, by at least one criterion, 
it is the most representative of the averages: the aggregate distance be- 
tween the median and each of the values is less than from any other point. 
It 13 therefore nearer to its companion values than any other average. It 
is in this sense that the median occupies the most central position in a 
distribution. 

Other Potilian Measvree. Instead of i^iorting a student in the upper half 
of his class, which requires the mediaa, we may wish for greater precision 
to locate him in a smaller interval, suA as the highest quarter, tenth, or 
126 





i 5^ Th£ JiIeah 


Yearly Family . 
V.S., 1955 


«, While and Non-WhiU, Urban 


Under *500 

3.4% 

* 500- 999 

4.3 

1,000- 1,499 

4.9 

1,500- 1,999 

4.9 

2,000- 2,499 

5.5 

2,500- 2,999 

5.5 

3,000- 3.499 

7.4 

3,500- 3,999 

7.2 

4,000- 4,499 

8.2 

4,500- 4,999 

7.3 

5,000- 5,999 

12.8 

6,000- 6,999 

9.5 

7,000- 9,999 

12.9 

10,000- 14,999 

43 

15,000- ai,999 

0.9 

25,000 and over 

0.5 

Pkb Cent 

100.0% 

— 



19 














Atzraqes 


measures of position, independent of the metric system used or of the suV 
Btantive type of the data. Thus, a person, who is at the 90th centile in 
intelligence may also be around the 90Ui centQe in income, thus suggesting 
an affinity between these two social phenomena. By such a transforma- 
tion, non-comparable measures may be usefully juxtaposed. 

As will be more clearly seen in a later section, these measures are also 
independent of the pattern of dtstnbuUon — whether normal, skewed or 
rectangular. This circumstance eihancea the versatility of quantile 
measures, and enables them to represent any variate as a rank position 
in a set of data, by means of the statistical procedures here set forth. 

Questions and Problems 

1. De6ne the following concepts: 

Median 

Position Average 
Rank Order 
Median Rank 
Median Pomt 
Median Glass Interval 
DecUa 

2. la it possible to compute the median of a combbed set of values when only the 
medians and total frequencies of each group are known? When total fre- 
quencies are identical? 

3. Explain why the median can be calculated, even when the frequency table is 
open-ended. 

4. In computbg the median of grouped data, what assumption is made concerning 
the distribution of items within the median interval? Compare tins with the 
assumption of the refined mode. 

5. Is it necessary to compute the median for grouped data when one of the cumu- 
lated frequencies is equal to ^7 Verify your answer with a simple illustrative 
calculation. 

6. (a) Compute the median of all family incomes (T*b!e S.2.4); of all white and 

non-white incomes, respeclivdy, Formula 5.2.1. 

(b) Could you have obtained the median of all incomes if only the distribution 
of incomes below 53,000 were prenT 

(c) Compute quartiles for both white and non-white ihstributlons and compare. 

(d) Would any of the above measures be affected if absolute frequencies were 
used mstead of percentages? 

7. Compute the median of the distributkin of suicide rates (Table 3.1. Id), by 
Formula 5 2.1. Compare with the nude of the same distribution and interpret 
the difference. 

8. Compute the median of the distiibntion of delinquency rates (Problem 11, 
p. <7). Compare with the mode. 

128 



5 5J The Meax 



&re by their rerpeclire frequencies (J), euamed, and then divided 

by ff. The devUlions (i) must likewise be weighted by cIass frequencies 
if), and will then eum to zero. 

The mean, &s that point arouird which the positive and negative dem* 
tions exactly balance, may be iDustnted graphically by a sketch of ful- 
crum, lever, and weights. In coefannity to childhood experience on the 
teeaaw, the a gg reg a te force of the chUdren on either side of the fulcrum is 



“5 -4 -3 -i -I A 1 2 3 4 


Fjccte SvS.la ef Sftv^ m Ctrjrr ef CwSy. 


determined not only by their oumber, but aL«o by their reepecUve di*- 
tanera from the fulcrum. In the tr»t sketch, the sum of the two weight* 
(—5 airl —2) below the (ulcnitn lialan'T* the two weight* (-^S and +-I) 
aliove the mean. In the sreood drawing, two weights of -3 equal tl.e 
ll.fee weight* of -^2, a* any ehUd on a seesaw would intuitively apjireriate. 

131 



Averages 


where X = the mean (read: “X-bar'7 

S = the sum of the variates (read: "summation X”) 

X = a variate, or value 
N = total frequency, or number of items 

This is a very ancient conception of the average, and is consistent with 
the etymological derivation of the term. The Latin hataria once referred 
to the insured cargo loss which was equally divided smong the participating 
shippers. This actuarial principle of spreading the risks survives, of course, 
as the basic procedure in mutual insurance practice. Although this arith- 
metic conception of the mean is quite adequate for most workaday pur- 
poses, it does not exhaust all of its statistical implications. 

Another approach to the detinitioa of the mean, necessarily congruent 
with the foregoing, views the values in the distribution as deviations from 
a central norm which is considered the true value. The scatter of shots 
around the bull's-eye of a target and the bell-shaped distributions of 
replicated laboratory, measurements illustrate this conception. The devia- 
tions on both sides of the central value constitute departures, or errors, 
from that norm. FVom (his point of view, the mean is considered the true 
value, free of errors. (Sec pp. 21-22). 

In line with this trend of thought, deviations often tend to be discounted 
as being random, temporary, excepUonal, or less valid than the central 
value. The despondent person who has encountered a "streak of bad luck" 
confidently expects "things to average out." He feels that something 
akin to a law of nature provides that bad luck in the long run must be 
balanced by good luck. Even a man who has enjoyed a run of good luck 
does not dare to "push his luck too far.” The average monthly income of 
a salesman, the average of Irregularly spaced ramfall, the batting average 
which is flanked by slumps and sprees, the "picnic finance" where expenses 
are equally divided — all these are firmly rooted in daily experience and 
in popular parlance. Statistically speaking, the true value emerges when 
the deviations cancel out 

The question is, therefore: what value would we have if the errors were 
eliminated? The answer is: that central value around which the errors 
exactly balance one another, so that the positive and negative errors, when 
added together algebraically, equal sero. If all deviations were eliminated, 
it must follow that all values wmld be identical. The mean may therefore 
be defined as that value tn a piwn opprepafe which would ocair if all the values 
were epual. Hence, the mean taken JV times would equal the sum of the 
observed values: NX - 2X. 

The basic principle of the balanced deviations is computationally illus- 
trated in Table 5.3.1. 

With ungrouped data, the mean is computed by summing the single 
values and dividing by N. In grouped data, however, the variates (X) 
130 



S Tm: Mum 



are utighied by their rcrpective frequencies (/), ftinuncd, arid then divided 
by y. ]Thc deviations (*) must likewise be weighted by class frequencies 
(/), and will then eura to icro. 

The mean, as that point around which the positive and negath'e devia- 
tions exactly balance, may be illustrated grapbically by a sketch of ful- 
crum, lever, and weights. In conformity to childhood experience on the 
seesaw, the aggregate force of the children on either side of the fulcrum is 


ti3-i — i-fTi I rTin 1 1 , 1 1 1 

-5 -< -3 -5 -1 ^ 1 2 3 4 


Fiocat 5.3.1 1 Dioffrom rf u CtfiUr of CranTy, 
I'rsg’mprf {Telti ^.5.la) 



Aveilioes 


where X - the mean (read: "X-bar”) 

w = the sum of the variates (read: "summation X") 

X = a variate, or value 
N = total frequency, or number of items 

This is a very ancient conception of the average, and is consistent with 
the etymological derivation of the term. The Latin Havana once referred 
to the insured cargo loss which was equally divided among the participating 
shippers. This actuarial principle of spreading the risks survives, of course, 
as the basic procedure in mutual insurance practice. Although this arith- 
metic conception of the mean is quite adequate for most workaday pur- 
poses, it docs not exhaust all of its statistical implications. 

Another approach to the definition of the mean, necessarily congruent 
with the foregoing, views the values in the distribution os deviations from 
a central norm which is considered the true value. The scatter of shots 
around the bull’s-eye of a target and the bell-shapcd distributions of 
replicated laboratory; meajuremenfs illustrate this conception. The devia- 
tions on both sides of the central value constitute departures, or errors, 
from that norm. From this point of view, the mean is considered the true 
value, free of errors, (See pp. 21-22). 

In line with this trend of thought, deviations often tend to be discounted 
as being random, temporary, exceptional, or less valid than the central 
value. The despondent person who has encountered a "streak of bad luck" 
confidently expects "things to average out.” He feels that something 
akin to a law of nature provides that bad luck in the long run must be 
balanced by good luck. E>'en a man who has enjoyed a run of good luck 
does not dare to " push his luck too for.” The average monthly income of 
a salesman, the average of irregularly spaced rainfall, the batting average 
which is flanked by slumps and sprees, the "picnic finance” where expenses 
are equally divided — all these are firmly rooted in daily experience and 
in popular parlance. SlatUtleally speaking, the true value emerges when 
the deviations cancel out. 

The question is, therefore: what value would we have if the errors were 
eliminated? The answer is: that central value around which the errors 
exactly balance one another, so that the positive and negative errors, when 
added together algebraically, equal lero. If all deviations were eliminated, 
it must follow that all values would be identical. The mean may therefore 
be defined as tAatroIue tn a given afu/rtgale teAi'cA would occur if all the values 
were egual. Hence, the mean taken N times would equal the sum of the 
observed values: NX — XX. 

The baric principle of the balanced deviations is computationally illus- 
tratedmTable 5.3.1. 

With ungrouped data, the mean is imputed by summing the single 
values and dividing by N. In grouped data, however, the variates (X) 
130 



§5.3 Tee Mean 


In Table 5.3.2a, each variate (ri*e of famDy) b conridered to be at tie 
midpoint of an interv'al of one (e.g., .5-1.5, 1.5-2.5, and so on). TTiese 
variates are readily weighted by the frequencies since they are already 
tabulated as midpoints instead of class intervab. "When, however, the 
class interval b larger than one, and the class limits are set down, the mid^ 
point must first be calculated. Each midpoint b then weighted as previ* 
ously, and the mean computed (Table 5,3.2b). 


Compulalion of Mean Family Size, 
Table 5.3.2b (Table BjS£a Rtgrowpedi 


_ MmpoiKr 

Tiiovt Sire 

/ 

1- 2 1.5 

3-4 3.5 

5-6 5.5 

7-8 7.5 

d-10 

44 66 

32 112 

14 77 

7 52.5 

3 2S.5 

N - 100 3364) 

^ “ IT " 100 “ 


In allowing .the midpoint to represent aS the values in a given class 
interval, we resort to the assumption that all the items in the interv'al 
have the value of the midpoint. Even though thb b not true, the assump- 
tion b not too violent, and b in keeping with the good statbtical practice 
of accepting approximation as a small price for expediting calculations. 

The mean of the regrouped distribution (3.36) differs slightly from that 
of the previous calcubtion (3.35), a libcrepancy which b merely a result 
of grouping error. 

The Concept oj Codiny. Since, in actual practice, frequencies as well as 
magnitudes are likely to be large, the forgoing methods will usually be- 
come quite cumbersome. Hence, certain arithmetic tactics have been 
devised to simplify the pen(ul-and.paper computations and to reduce the 
possibility of errors. One type of operation has come to 6e known as 
coding. 

Coding b a generic term which may denote; (1) the substitution of a 
convenient simplified symbol for a set of complex values, or (2) the reduc- 
tion of a tna.«a of data into rimpln seta of categories. Consequently, the 
general concept has wide applicability. The first type b exempfified by 
the code employed in the preparation of data for electronic sorters and 
• 133 




Ateeaoes 


FioTOS 5^.1b Dvigram of Mtan 
a» Center ofGraviti/, Grouped Data 
(TabU SJ.lb) 

-3 -2 -i i 2 3 

Calculation: Grouped Data. As <ie fitudent now knows, grouped data do 
not differ from simple ungrouped data except m the fact that similar 
values are grouped together, and the frequency, or icefpftt, of the grouped 
items is indicated for each group in the "/'* column. Therefore, to calcu- 
late the mean of grouped items, we obtain the sum of the weighted values, 
and dinde by N. 



Table 

Computation of Mean Family 
Site, One Hundred Familiet 



To determine^ for example, the mean number of persons per family, we 
must first determine the total number of persons (2/X), and then divide 
by the number (AT) of families. Since there are 20 families of one person 
each, weighted total in tWs class will be 20; 24 families of 2 persons 
each give a total of 48 persons. Proceeding as in Table 5.3.2a, we find the 
mean to be 3.35 persons per family. 

Although this average, based as it is on discrete data, cannot correspond 
to any observed size, it is nevertheless a useful measure of magnitude, 
which, as with the median, no one finHa disturbing. 

132 



I Mean 


Table A 

Compalalion of Mean, Coding fcy 
SuhlToeiion 


Obicin.il 

Coded 

Vaeves 

Values 

S.000,003 

3 

SrOOO.OOT 

7 

5.000.003 

8 

15,000,018 

15 

X = 5,000.006 

X = 6 

X = 5,000,000 + 6 

= 5,000,006 


remainders; and finallj (3) to bring forth the 5,000.000 and restore it to 
the coded mean. We thereby obtain the mean of the original values. Any 
other conceivable codinj constant could have been subtracted, and the 
fin ;^1 result TTDuId have been the same. But if the objecti^'e is maximum 
simplicity, the only practical eh<nce would be 5,000,000 as oux coding 
constant. 

Method of the Guessed Mean. The foregoing procedure reduces itself to 
the method of the guessed mean.* By imT^ectioo of the oversiznpllfied 
example quoted aba>*e, it is clear that the mean would be close to 5,000,000. 
We e^d have accepted this Sgure as a tentative mean, and viewed the 
coded variates as deviatioas from the preliminary guess. We would then 
“correct** the guessed mean by an amount equal to the mean of the devia- 
tions from the gues^ mean. The fuU procedure would be carried out 
according to the following formula: 

X = r' + C (5.3.2) 

where X = the mean ' 

X' = the guessed mean 
X ~X' = deviation from the guessed mean 

- ZtX-X*) 

C , or the correetjon faeior 

For the data in Table 5.3.5, we set the guessed mean equal to 5, and com- 
pute the deviations about this provisional mean. It could hardly ever be 
expected that the guessed mean woufif turn out to 6e the true mean. .&ut, 
from the dehnition of the mean, we know that if the choice had been correct, 
the sum of the deviations would have been zero, and the problem would 
be solved. Since the mean of the deviations exceeds zero by 1.4, we most 
raise the guessed mean by that amoont, which correspondingly depresses 

* Tliis tnelhod of the neas is also knovn b? other labels, consistent with this 

desienatkiQ: aamned wuon, pronnonaf neoit, and vbttrary erijin. 


135 




Averages 


computers, or the Morse code in telegraphy. The second type is Ulustrated 
by the arithmetic simplification of bulky f^res with which we are here 
concerned. However, unlike the codes which a military enemy is deter- 
mined to decipher or “break,” complexity in a statistical code is not coa- 
odered a protection or a virtue, but rather a defect. 

The type of coding here under conaderation rests on the elementary 
principle that uniform arithmetic treatment of a series of values leavts 
corresponding relations unaltered: uibtractioc or addition of constants 
leaves intervals between values undtanged; multiplication or division by 
like factors leaves ratios between values within the series unaltered. This 
is demonstrated in the series shown in Table S.2.3: the subtraction of the 


Table 5.3.3 
Sub(rae(tn;r a Consianl 


X 

0 

X-50 

100] 

100 

r 

200] 

100 

(150 

300 

200 

250 

500 

600 

450 

1,000 


050 


constant 50 from each value does not alter differences between values. 
Analogously, the division of each value by 50 does not alter ratios among 
values. Thus, 200:100 r:45, and 1,000;100 ;:20;2. 

In statistics such coding is designed only as a temporary convenience. 
Consequently, after the coded computations have been completed, the 
results must be decoded: the constant factor used in division must be 
reintroduced by multiplication; and, the factor subtracted must be added 
in order to restore the original quantities, and thereby produce the results 
which would have been obtained without encoding in the first place. By 
adding 50 where we had previously subtracted it, or by multiplying by 50 
where we had divided by it, it is obvious that we have merely re-established 
the original values as though notlung had happened. 

Coding Applied Co Che 3ffan. Let us suppose that we are required to find 
the mean of the three large numbers shown in Table 5.3.4. The given 
values sum to 15,000,018 and have a mean of 6,000,006. Now it will bo 
evident that it was wasteful to carry along the heavy freight of 5,000,000 
It would have been much simpler: ( 1 ) to subtract the 5,000,000 from each 
value, to sequester it, as it were; (2) to find the mean of the manageable 
134 




I 5.3 The Mean 


Talk 5.3.4 

CompiUalion of Mean, CodinQ by 
Subtraction 


Original 

Coded 

Values 

Values 

5,000,003 

3 

5,000,007 

7 

5,000,008 

8 

15,000,018 

18 

X = 5,000.006 

X = 6 

X * 5,000,000 + 6 

■= 5,000,006 


remainders; and finally (3) to bring forth the 5,000,000 and restore it to 
the coded mean. We thereby obtain the mean of the original values. Any 
other conceivable coding constant could have been subtracted, and the 
final result nrould have been the same. But if the objective is maximum 
eimplicity, the only practical choice would be 5,000,000 as our coding 
constant. 

Method of the Gueased Mean. The forcing procedure reduces itself to 
the method of the gueaaed mean.* By inspection of the oversimplified 
example quoted above, it is clear that the mean would be close to 5,000,000. 
We could have accepted this figure as a tentative mean, and viewed the 
coded variates as deviations from the preliminary guess. We would then 
“correct” the guessed mean by an amount equal to the mean of the devia- 
tions from the guessed mean. The fuU procedure would be carried out 
according to the following formula: 

X-X' + C C5.3.2) 

where X = the mean • 

X' - the guessed mean 
X ~X' ^ deviation from the guessed mean 

— X*) 

C - — ^ or the correction factor 

For the data in Table 5.3.5, we set the guessed mean equal to 5, and com- 
pute the deviations about this provisional mean. It could hardly ever be 
expected that the guessed mean would turn out to be the true mean.- But, 
from the definition of the mean, we know that if the choice had been correct, 
the sum of the de\'iation5 would have been scro, and the problem would 
be solved. Since the mean of the deviations exceeds rero by 1.4, wc must 
raise the guessed mean by that amount, which correspondingly deprcs.«e3 

* This method of th« |pies!>ed mesn is bIm koovu by other Uhels, coasistent with this 
dericnation: OMutned man, prenn<n%al men, and arbUrorg origin. 


135 




Averages 


computers, or the Morse code in tdegraphy. The second type is illustrated 
by the arithmetic simplification of bulky figures with which we are here 
concerned. However, unlike the codes which a military enemy is deter- 
mined to decipher or “break," complexity in a statistical code is not con- 
sidered a protection or a virtue, but rather a defect. 

The type of coding here under cmirideration rests on the elementary 
principle that uniform arithmetic treatment of a series of values leaves 
corresponding relations unaltered: subtraction or adcUtion of constants 
leaves intervals between values unchanged; multiplication or division by 
like factors leaves ratios between values within the series unaltered. This 
is demonstrated in the series shown in Table S.3.3: the subtraction of the 


Table 5.3.3 
i?ub(rac(tnp a Conslant 


X 

D 

a:-5o 

100 1 

100 

r 

20oJ 

100 

[l50 

300 

200 

250 

600 

600 

450 

1,000 


050 


constant 50 from each value does not alter difierences between values. 
Analogously, the division of each value by 50 does not alter ratios among 
values. Thus, 20OJ00 ::4:2, and 1,000:100 ;;20;2. 

In statistics such coding is designed only as a temporary convenience. 
Consequently, after the coded computations have t^n completed, the 
results must be decoded: the constant factor used in division must be 
reintroduced by multiplication; and, the factor subtracted must be added 
in order to restore the original quantities, and thereby produce the results 
which would have been obtained without encoding in the first place. By 
adding 50 where we had previously subtracted it, or by multiplying by 50 
where we had divided by it, it is obvious that we have merely re-established 
the original values as though notlung bad happened. 

Codirtg Applied to the Mean. Let us suppose that we are required to find 
the mean of the three large nombera shown in Table 5.3.4. The given 
values sum to 15,000,018 and have a mean of 5,000,006. Now it will bo 
evident that it was wasteful to carry along Uie heavy freight of 5,000,000 
It would have been much simpler: (1) to subtract the 5,000,000 from e a r h 
value, to sequester it, as it were; to find the mean of the manageable 
134 




S 5.3 Tek Mean 


rates. (1) The midpeant of the interval wit£in which the true mean prolK 
ably lies is selected as the guessed mean. This is usually, but not neces* 
sarily, the class interval of the largest frequency. We choose 13. (2) The 
guessed mean is then subtracted from each midpoint (Column 4); that is, 
it is used as a subtraction code. The deviations resulting from this oper* 
ation — 3, 6, 9, and so on — are all necessarily multiples of the class 
width, 3. This suggests (3) coding by division, using the interval as 
the dUisor. The results of this operation are designated x' (Column 6). 
(4) We now weight the twice-coded midpoints by their respective fre- 

Sfx' “IS 

quencies (Column 6) and compute their mean: 

This completes the encoding operation and seta the stage for the reverse 
course of decoding. Accordmglj', (5) we multiply the doubIe<oded mean 
by class width and thereby obtain the correction factor: 



--.50 


We finally (G) add this correction factor to the guessed mean in order to 
find the true mean; 

T - 13 -h (-.60) 

-12.5 

Putting the separate steps together we obtain the complete working pro- 
cedure: 

X-X' + ^X.) (5.3.3) 

Substituting the quantities calculated above, we have 

- 13 + (-.60) 

-12.5 

^fecAanicai Routine. The foregoing procedure may at first appear cir- 
cuitous and complicated. However, it becomes very simple in actual 
operation, since the twice-coded values may bo more quickly obtained 
without the inter^-ening step presented in Column 4. Instead, we simply 
designate as zero the inten’al of the guessed mean, from which we measure 
the intcr\‘al-de%'iatjons, a.s in Colunm 5. 

This abbreviated routine is illustrated in Table 53.Cb, which is ejcsen- 
llally a worksheet on which the necessary stejjs are carried out. It yields 
all of the quantities required by the formula. 

The manner of calcuLstion demonstrates eflectively the economy of 
selecting the guessed mean in the general n«ghborhood of the true mean. 

137 



AVERAOES 


each deviation around the guessed mean by the same amount. The fact 
that the correction factor is posttive indicates that the guessed mean was 
too low. 


Table 5.5.5 

Camputiru/ the Mean, Method of 
the Gueseed Mean, X' = 5 


X 

X~T 

2 

-3 

5 

0 

7 

2 

8 

3 

10 

5 

32 

7 


- rfx-xo 

X » 6.4 



« 1.4 

X 

-X'-hC 


-5 + M 


- 6.4 


Grouped Data. Of course, the above trivial illustration does not require 
the short-cut technique of the guessed mean. It is grouped data which 
stand in need of such assistance. And here it is expeditious to code not 
only by subtraction, but by division as well. We will demonstrate the 
steps in this procedure (Table 5.3.6s) on the frequency table of 107 suicide 


TcMe 5.3.6a Compilation of Mean, Grouped Data, iSut'ctde Rates, 1950 



(2) 

Mdpt. 

(2) 

(3) 

/ 

(4) 

X-X' 

(5) 

X~X' ^ 


3- 5 

4 

6 

-9 

-3 

■9' 

6- 8 

7 

18 

-6 

-2 


0-11 

10 


-3 

-1 

-29 • 

12-14 

13 (Jt 

24 

0 

0 

0 

15-17 

16 

13 

3 

1 

■ 1 

18-20 

19 

7 

e 

2 

■ 1 

21-23 

22 

4 

9 

3 

■ f 

24-26 

25 

4 

12 

4 

16 

27-29 

28 

2 

JV- 107 

15 

S 

10 

-18 


136 






§5.3 Tee Mean 


It is obvious, however, that In maB3r problems, unequal weights will be 
eDcouDtered. In such a case, the means of the subgroups, piior to their 
being combined, would have to be specifically weighted by their re- 
spective N'b. 

Thus, if the two housing units, with grade-point averages of 1.4 and 
1.8 had populations of 26 and 42 respectively, each mean would have to 
be accorded its proper weight in order to obtain the mean of the combined 
group. This is done by multiplication, which constitutes the procedure 
of weighting (Table 5.3.7). Tlje resemblance to the familiar frequency 


Table 5 3.7 

Compulation of WexgUed ^fean 


X 

f 

IS 

1.4 

26 

36.4 

13 

42 

75.6 


6S 

112.0 

W'eighted ^ ^ 



B 1.65 grade points 


per student 


table is obvious. In fact, the summing of the ordinary frequency table 
is a process of weighting each midpoint by its frequency. 

The failure to weight often leads to absurd results. We cite an illustra- 
tion from baseball: 

A batter, in 75 trips to the plate, has amassed 25 hits for a batting average 
(mean) of .333; on this day he gets 5 hits in 5 trips, for a day’s a^rage of 
1,000. What is the combined average? 

A naive grandstand “statistician” may odd the two averages, and then 
divide by 2, for a combined average of .007 — a patently unreasonable 
result. The proper calculation is shown in Table 5.3.8. 


Table 533 

Mean of Combined Sets, Uneqiud 
N’t 


X 

/ 

/X 

333 

1.000 

75 

_5 

80 

25 

J 

30 

Weighted X - — 

- 375 (batting average) 


139 





AYZRAG£S 


Worhihed for Compulation 
Table 5 . 3 ,6b of M&xn, Grouped Data 


(1) 

Rate 

(3) 

f 

(5) 

X' 

(6) 

ft' 

3- 5 

6 

-3 

-18 

6- 8 

18 

-2 

-36 

9-11 

29 

-1 

-29 

12-14 

24 

0 

0 

15-17 

13 

1 

13 

18-20 

7 

2 

14 

21-23 

4 

3 

12 

24-26 

4 

4 

16 

27-29 

2 

W“107 

5 

10 

-18 


tlowever, the priociples of arithmetic do not require this selection. Any 
other “guessed” value will do the work — but lees efBciently. In machine 
calculation, however, when arithmetic economy is not such an important 
factor, the calculation may be simplified by arbitrarily setting the guessed 
mean at zero, and coding from zero — which is equivalent to using the 
given raw values. 

The Weiibting of Means. The of a valued statlsUcally speaking, 

is simply its frequency. Any collective measure may, therefore, be con- 
sidered as having a weight equal to the number of observations it repre- 
sents. The mean is such a collective measure; it is not itself an observed 
value; it is a derived figure. Consequently, like ratios and percentages, 
which are also derived values, the mean should never be manipulated 
without taking its weight into coosideiation. 

Two or more means are, themselves, frequently averaged ; that is, we* 
may take the mean of a series of means. If the means derive from equal 
N’s, i.e., are of equal weight, there is no special difficulty in the operation. 
Consider the following problem: 

A group of 42 students, living in one houring u(ut, have a grade-point average 
of 1.4, another group of 42 stodenta, in another housing unit, have a grade 
point average of 15. What fa the average lor the 81 students’ 

Since the grade-point means have equal weight, one need only add the 
two means and divide by 2: 

x.lip 

= 1.6 grade points per student 


138 




S5J Tee Mean 

operations, as in the preparation of budget noims used by welfare workers, 
or cost of living indexes employed by economists. 

Furthermore, as with many other prevalent statistical concepts, the 
term possesses fringe meanings which may at first seem to be inconsistent 
with the quantitative meaning here assigned. Some of these meanings 
are more metaphorical than material — as, for example, the weight assigned 
to one person’s opinion or the weight of an argument. When more closely 
inspected, however, the varieties of connotations will be found to have a 
pervasive common core which leaves the utility of the concept unmarred. 

Ulilily of the Mean. Although the mean is the most widely us^ average, 
it is not of universal applicability. Its fundamental characteristic, namely , 
that it reSects the magnitude of every item in the array, renders it at the 
same time both appropriate and inappropriate for specific purposes. Since 
it reflects every value in the array, it will be affected by the extremely 
high or low values that are always found in a skewed distribution, and it 
will therefore lose its typicality and perhaps mislead the reader. This 
phenomenon had led some to assert that the mean should not be used when 
the data are markedly skewed. 

But such a conclusion confuses the arithmetic and the conceptual aspects 
of this statbtleal calculation. In a statistical sense, it is true, the mean 
may not be typical; however, in a substantive sense, the mean value may 
be the very one desired. In the case of “picnic finance,” for example, it is 
Intended that the extreme magnitudes (e.g., a person’s outlay for expensive 
meat) be reflected in the equal division of expenditures. On the other 
hand, if a young lawyer were estimating the site of his prospective income, 
he would not compute an arithmetic average of the combined incomes of 
corporation attorney's and those of the less prosperous but more numerous 
E9I0 lawyers. The mode would furnish a better indication of his prwpects. 
The inappropriateness of the mean for this set of data lies in the hete^ 
geneity of the lawyer group, in which the young as well as the experienced 
are indiscriminately merged. 

The fact that the magnitude of every item is accurately weighted e^o^ 
the mean with a certain mathematical character which is not sba y 
any other average. Hence, the mean may be used to build up sub^ue^ 
computations. We have demonstrated that means can, them^hes, 
averaged and weighted, can be segmented and recombined, a quality w ic 
is not possessed by the mode and the median. Means can be inanipu 
algebraically because they have been obtained algebraically. This extraor 
dinary utility of the mean confers upon it a prestige that sometimra 
undeservedly overshadows the more Unuted but still quite specific an 
indispensable qualities of the other averages. 


U1 



Averages 


UntcetghUd Means. The logical tequirement of weighting is not always 
as clear cut as it is in tie above fllustratioii of tie batting averages. There 
are occasional instances in which discretion may be employed concerning 
whether to weight in the conventional manner as described above, or 
whether to disregard the A^'s from whidi they have been derived. 

Let us suppose that we wish to calculate the grade average of a large 
number of housing units. The previously described procedure of weighting 
by the number of students in ead» bouring unit would naturally yield the 
grade-point average per student However, if we desire the grade average 
per housing unit, we would ignore the size of the population of each housing 
unit and compute the average of the house averages. Such a mean is 
usually labeled an unieeighled average. 

In a certain sense, of eoume. there is no such thing as an unweighted 
average; it is only a question of which weight we select, and this is deter- 
mined by the nature of the information desired. To put it concisely, it 
is a question of the average "per what" we demand. In the following 
problem, the choice of the unweighted average will again seem most reason- 
able. 

Ten of the largest cities in the United States have suicide rsies (per 100,000 
population) as follows: S, 9, 3, 0, 1(, 6, 10, 2, 1, 7. What is the mean? 

Presumably, we could weight the rate of each city according to its 
population size, and compute the combined rate, as we did for the bat« 
ting average. However, we are not likely to be interested in the suicide 
rate of such a population aggr^te, unce this aggregate of persons in the 
ten individual cities does not constitute a meaningful social unit. Hather, 
we are more likely to be mterestcd in the average rate per city in this cate- 
gory; hence, each city would be treated as a unit, and not weighted for 
population size. This average is an uaweighted mean of the rates for the 
individual cities: 


= 5.8, the mean rate per city 

There can, of course, be norontine, ironclad rule on the issue of weighting. 
It is a question of purposes and consequences Consider, for example, the 
consequences of weighting the above list of cities by population, of which 
N'ew York City is one. That city would then overwhelm the rates of all 
other cities and produce a compodte rate that would differ very little from 
that of the great metropolis. For this reason, as well as the even more 
pertinent one that the unit of interest is the individual city and not the 
person, the unweighted average is called for. 

Although the concept of wdghting is here applied only to mean«, rates, 
and percentages, it is widely applicable to many complicated combining 
140 



1 5.4 Cbitebu. roB Choice or Atc&lse 


12. In Table 5.3.9, if the letter grades are scored as follows: A (3), B (2), C (1), 
what is the student's grade pc^t pw anirse? per hour? 

13. Compute the mean of the frequent distribution of delinquency rates (Prob- 
lem 11, p. 47), according to Formula 5.3.3. 


Section Four 

Criteria for Choice of Average 

Because we are frequently faced with alternative choices among the 
various averages, it is necessary to lay down a few principles which will 
serve as guides in the selection of a central ralue. To a certain extent, 
this problem has already been clarified. It has been stated that there is 
no all-purpose average which c-m be universally employed. It should 
always be kept in mind that an average is a single representative value 
which possesses the convenience of compactness, but also the inconvenience 
of its bre\nty. Even at best, it will conceal and e.xclude as much or more 
infonnation than it reveals of the dbtribution from which it is extracted. 

Three major criteria, in order of priority, which may be used in judging 
the applicability of an average are: (1) the purpose to be 6er\’ed, or the 
question that it is designed to answer; (2) the paltcm of the distribution 
of the data; (3) various technical considerations, primarily of an arith- 
tnetie nature, which limit the choice of average. 

Pvrpote To Be Served. Any empirical sociological inquiry is essentiaUy 
an attempt to answer a question of a substantive nature, for which the 
statistical procedure is only a tooL Questions to which one or another 
average would supply the answer could be: TVTiat is the size of the Ameri- 
can family? \Miat is the length of life? What is the age of the .\merican 
population? Such questions must be further interpreted in terms of the 
purposes of the inquiry. If the average size of the American family is 
desired for purposes of planning a bousing development, the crude mode 
would be more pertinent than the mean or median, e%*en though the precise 
degree of modality is unknown. Houses are not built for non-existent 
arithmetic averages, but for actual families that do e-tist with a certain 
frequency. If on the other hand, the aveiage is to serve m a measure of 
general fertility, the mean would be more relevant, since it would reflet 
the contribution of the most fertile as well as the least fertile. For actuarial 
purposes, the equal distribution of risks by very definition requires the 
mean length of life. 

PatUm of Dislribrdion. Distributions may, of course, range in degree from 
tvmmelrical to extremely tkevtd. Symmetry means that the values are 

143 



Averages 


QuESTIOI« and pEOBLEifS 

1. Define the following concepts: 

Arithmetic Mean 
Coding 
Weighting 
Weighted Mean 
Guessed Mean 

2. Explain why the sum of the deviatiema around the mean is sero. 

3. In computing the mean of grouped data, what value ia assigned to all ilema 
m a given class interval? Is tliB a reasonable procedure? E*pl^. 

4. Give several instances when the arithmetic mean would be appropriate, even 
though the scatter of items is highly shewed. 

5. When sero is used as the guessed mean in coiapoting the mean, what is the 
value of C, or the correction factor? 

6. Discuss the validity of the statement that the extreme values in a skewed 
distribution make a disproportionate contribution to the arithmetic mean. 

7. Outline the procedure for finding the mean of two or mote groups when only 
the total frequency and the arithmetic mean of each group is given. Express 
this procedure symbolically. 

8. If the mean of 100 items is 10, and the mean of 50 items is 15, what Is the 
mean of the combined group? What would the mean be if each group had 
SO items? 100 items? 

0. A group of 25 families has a mean weekly income of 845.00; a second group 
of 15 families has a mean weekly income of 860.00. What is the total income 
of the combined group? 

10. If the mean of a series is 6 and the guessed mean is 5, by how much on the 
average U each devution from the guessed mean too large? 

11. A student received letter and percentage grades in his courses as shown in 
Table53.0. Whatishi3percenUgegradepercourse7perhouT7 Whlchaverage 
would be used? Would you wright one or both averages? Explain. 

Table 53S Student’s Grade Record 


Subject 

Ckemt Hours 

Grade 

Percentage 

Letter 

History 

5 

92 

A 

English Literature . . . 

3 

85 

B 

Psycholo^ lab 

2 

72 

C 



5 

95 

A 


142 




1 5.4 Criteria for Choice of Atehagb 


however, additional computations are anticipated, the mean and its 
derivatives must be selected. 

Minimum, Maximum, and Inlcrmediale ifrasures. In many statistical dis- 
cussions, only the averages seem to be considered as possible measures of 
location. However, it should be re-emphasized that this b a limited con- 
ception. Measures of location are not necessarily measures of typicality. 
Averages tend to connote typicality or representativeness; but the maxi- 
mum, minimum or any intermediate value may serve to fix location, pro- 
vided it is selected from the whole anay. For example, although many a 
prospective university student (or his parent) may find the average ex- 
penditure on the campus a useful guide, an impecunious student may be 
interested only in the minimum, and an afQuent one only in the maximum, 
as descriptive of the standard of Uving on that campus. Consequently, 
it should be reiterated that an average must be viewed as only one of 
the many measures of location, each appropriate to a specific problem. 

Characteriitics 0 / Averages. The selection of the proper average presents 
problems not posed by other measures of location. Hence, a thorough 
knowledge of the descriptive characteristics of each average U a prerequisite 
to their proper employment. The characteristics of the different averages 
are frequently dbcussed in terms of their “advantages and disadvantages. " 
But these terms are evaluative, rather than descriptive, and therefore have 
no fixed meaning. An advantage in one context may be a disadvantage 
in another. Hence, we prefer to set forth the characterbtics which are 
intrinsic in the averages, quite independent of the setting in which they 
may be used. 


Summary of Charackrislics of Averages 

The Mode 

1. It is the most frequent value in t(^ distribution; it is the point of 
greatest density. 

2. The value of the mode b estabibhed by the predominant frequency, 
and not by the values in the dbtribution. 

3. It is the most probable value, and hence the most typical. 

4. A given distribution may have two or more modes. On the other 
hand, there is no mode in a rectangular dbtribution. 

5. The mode does not reflect the degree of modality. 

G. It cannot be manipubted algebraical!}': modes of subgroups cannot 
be combined. 

7. It is unstable in that it b influenced by grouping procedures. 

8. Values must be ordered and grouped for its computation. 

9. It can be calcubted when table ends are open. 

10. It is the only average which can be applied to qualitative vaiiables. 

145 



Ateiuges 


distributed identically on either ride of the mean; while skew is merely 
the absence of symmetry. The dq;ree of skewneas affects the typicality 
and representativeness of the avera^ values, and hence must be taken 
into account in judging their relevance. 

It is sometimes alleged that, if the frequency curve is symmetrical, there 
is no problem of choice, since mean, median, and mode are identical. This 
is true, however, only in an arithmetic sense, not in a conceptual sense. 
Even with .the same numerical value, there is a different imagery attached 
to the concepts of the mean, mode, and median. For example, the “aver- 
age” student does not concave of bis grade as being equal to the sum of 
all grades divided by A', but rather«mceive*of it as a position in the array. 
The young physician, new to the commuiuty, in estimating his probable 
income, does not concern himself with the mean, but with the modal in- 
come, even though quantitatively they may be identical. From these 
illustrations we may conclude that, even though the distribution is sym- 
metrical, we still select the average which conceptually satisfies our pur- 
poses. 

As the distribution becomes more and more skewed, the values of the 
averages diverge correspondingly, and the choice of the average becomes 
more critical. In such cases, the mean loses its typicality, in that it U less 
likely to be empirically encountered. In the U-curve, in fact, the mean 
may bo practically nonexistent, and therefore may be quite unrealistic. 
Since many sociological data — wages, sizes of cities, sizes of families, and 
the like — are often severely skewed, It b essential to consider the pattern 
of the curve in the selection of the most suitable average. 

Tedinteal Considerationi. There are certain purely technical features of a 
tabulation which may compel the use of one or another average. Thus, 
rince the mean cannot be calculated for open-ended data, the median may 
have to be resorted to. But tbb is often quite satisfactory provided the 
distribution b not unduly skewed, in which event mean and median do 
not differ much in any case. 

On the other hand, the mean b the only average computable when only 
the total values and N are known, even though another average might 
have been preferred. This is illustrated by a citation from the U.S. Census, 
1950: 

^ ^ Total Popubtioa in Families 
Number of Families 
138,079.600 
“ 38,310,980 
•>3 6 persons per family 

The technical requirements of posriblo subsequent calculations should 
always be kept in mind. Since there bno method for combining or weight- 
ing medians and modes, they tend to become terminal measures. When, 

144 



{ 5.4 Crttebu roE Choice or Atexage 


Being influenced by each individual value, the mean of a skewed distribu- 
tion irilJ be pulled in the direction of the extreme values. The mode, of 
course, is not influenced by the of the distribuUon; and the median 
is drawn toward the tail solely by the relative frequency of items in that 
tnil. However, this latter attractioa is not very great because the concen- 
tration of items in the tail is necessarily sparse. Accordingly, when the 
skew of a unimodal distribution is to the ri^t (Figure 5.4.1), the order of 


f 



Ficraz 5.4.1 ^ftan, Median, and Made, JiighJ Sifv 

the averages on the base line win be; mode, median, and mean; and the 
gap between them will vary according to the severity of the skew. A skew 
to the left will, of course, reverse the order. 

But even two averages of the same type cannot be safely compared 
unless the patterns of distribution are rintilar. It is not too reckless to 
profler the maxim that two or more averages should never be compared 
unless "all other things are equal” or at least similar. The comparison of 
the mean height of males (X) and that of females (F) (Figure 5.4.2} is 



Ficcee 5.4.2 Jfron UeigSte of MaUt (X) end Fevjdet (T*) 

147 


Atzsaqes 


The Median 

1. It is the value ol the middle pt^t of the array (not midpoint of 
range), such that half the items are above and half below it. 

2. The value of the median ia fixed by its position in the array, and 
does not reflect the individual values. 

3. The aggregate distance between the median point and all the values 
in the array is less than from any other point. 

4. Each array has one and only one median. 

5. It cannot be manipulated algebraically: medians of subgroups can- 
not be weighted and combined. 

6. It is stable in that grouping procedures do not appreciably alTect it. 

7. Values must be ordered, and may be grouped, for computation. 

8. It can be computed when ends are open. 

9. It is not applicable to qualitative data. 

The Mean 

1. It is the value in a given aggregate which would obtain if all the 
values were equal. 

2. The sums of the deviations on either side of the mean are equal; 
hence, the algebraic sum of the deviations is equal to zero. 

3. It reBccts the magnitude of every value. 

4. An array has one and only one mean. 

5. Means may be manipulated algebraically; means of subgroups may 
be combined when properly weighted. 

6. It may be calculate even when individual values are unknown, 
provided the sura of the values and iV arc known. 

7. Values need not be ordered or grouped for its calculation. 

8. It cannot be calculated from a frequency table when ends are open. 

9. It is stable in that grouping procedures do not seriously aflect it. 

10. It is applicable only to quantitative data. 

Comparabtltli/ of Averaget. Like every other statistical quantity, averages 
are Used for comparative purposes. Specifically, averages are used to com- 
pare the locations of distinct groups or distributions on the same scale. 
It should be obvious, however, that the dificrcnt types of averages are not 
comparable That ia to say, the meaa of one distribution cannot be legiti- 
mately compared to the mode of another in determining their relative 
locations on a given continuum. Tho reason for this prohibition is simply 
that the various types of averages are not coordinate measures; they 
reflect entirely difTexenl aspects of a distribution; the mode reflects the 
highest /regency, the median reflects the middle posifion, and the mean 
reflects tfoe centrality of iviJury, 

These now familiar difTerentiating diaracteristics become all the more 
apparent when the effect of ikemuu on the averages is taken into account. 
146 



i 5.4 Cbiteria fob Ceoics of Afsbaoe 

approximately the eame range; the means are separated, however, because 
of the skew in one of the distributioos. Here, too, the modes, which are 
almost identical, would do greater justice to the location of the data. Of 
course, when dominance of location is less pertinent than centrality of 
values, OUT comparison would necessarily be based on means. 

Conduiion. The foregoing discusrion inevitably leads to the truism which 
could have been anticipated: averages, like every other statistic, are 
incomplete descriptions of a set of data. They should not be conceived of 
as autonomous, but rather as being bound to the data from which they are 
derived. A responsihle use of the average — or, for that matter, any other 
measure of location — must take into consideration the tendency of the 
user to reconstruct mentally the distribution from which only a single 
value has been extracted. To supplement this incomplete description 
afforded by the average, it is essential to examine the scatter or dispersion 
of the distribution relative to the average. This is the subject of the next 
chapter. 


Questions and Problems 

1. Define the following concepts: 

Symmetry 

Skew 

Right (Positive) Skew 
Left (Negative) Skew 
Unlmodal 

Rectangular Distribution 

Maximum 

RT 'inimiim 

2. State in general terms how right skew aSects the three common averages; 
left skew. 

3. In a frequency distribution of family incomes in the United States (Table 5.2.4), 
how would the median, mode, and mean tank by order of magnitude? 

4. As a result of the mental tests administered to American soldiers during World 
War I, it was reported that "most Americans were below the average score." 
Is that possible? Probable? Which average could have been meant? 

5. When wouW the maximum value best represent a frequency distribution? 
The minimum? 

6. How would the median and mean grades in a certain class be affected if: 

(a) the poorest students in the class withdrew? 

(b) the poorest students were successfully tutored? 

(c) the middle students were successfully tutored? 

(d) an easier examination were pvenT 


149 



Ate RAGES 


quite appropriate. Meu are taOer thaa-womeu, aud the degree of difference 
is properly measured by the difference in their means. Males and females 
are located merely on different segments of the same scftle of heights; their 
patterns of variation are identical. 

However, when the pattwn of distribution differs markedly, such com- 
parisons may in fact be very misleading, especiaUy in the case of means. 
For example, when two means are qnotrf as identical, the reader may be 
misled into reconstructing the curves imaginatively also as identical pat- 
terns. In truth, however, the two series may present quite different con- 
tours, as in Figure 5.-4.3. 



F:g 9BE 6.4.3 Oppose Shwt, Idenlieal ileona 

To be sure, in a gross sense, the locations of the two sets of data are 
identical, for they cover exactly the same range. But the locations of the 
predominant segments of the dbtributions are widely separated, and would 
be more accurately apecilied by the modes. In this instance, the identical 
means are a consequence of the overlapping tails, rather than a reflection 
of the overlapping humps. 

In Figure 5 4.4, the locations of the predominant frequencies are within 



FiooBS 6.4.4 Sfffti ^ Skfa m the Mean 


148 



Variation 


Section One 

Mtasures of Sprtad 


Cmce-pi of TanotiVm.* Variatioo is the occasion for all etatisUcs. If all 
the valuoa in a given set were identical, it would be superfluous to cal- 
culate an average — or, for that matter, any other statistical measure 
— ance any single measure would already accurately represent all. The 
very purpo.«e of averaging is of course to provide a single value to represent 
a group of unlike values. In fact, averages were mvented to suppress the 
diflerences among values whenever such differences are not pertinent to 
the case. 

However, under certain circumstances these differences may be of as 
much or even more interest than the average itself. Thus, in arrivTng at a 
final grade, a teacher will consider not only the average of the student's 
test scores, but the spread of those scores as well. A student with marks of 
100, 90, and 50 will be differently evaluated from one with scores of 80, 
83, and 77, although both present the same moan score of 80. In selecting 
for \Tirsity competition between two players with equal scoring averages, 
the basketball coach is more likely to use the consistent player who is 
seldom off tliat average, rather than the erratic performer who is generally 
low but now and again spectacularly high. Analogously, two occupational 
groups h-aving approximately the same mean annual income — professors 
and business executives, for example — may nevertheless present very 
different patterns of income opportunities. In the teaching profession, 
* Many textbooks da not twtwtvn the terms T<mab<Iity and ranotion. tVe 

irrtnet oursclm to the concept v&nation, deCnlns variability u the capacity to vary, 
and variation a* the maalfestalion of that capacity which we endeavor to d-wn!>c ami 
meawire. tVitbout r»n.aUon, there cooM b» bo -taUiitir* in the £r«t plw. For (bat 
reason, rtalUtica ia now and then referred to at the ’‘acKnee of variation.’' 


151 



Aveeages 


7. In what type of distribution would the median student not be included among 
the modal students? Show graphically. 

8. Does the expression, “the average Englishman” have any statistical meaning? 
Can the concept be quantified? Similariy: the average married man, average 
eye-color, average Catholic, awrage American, average taxpayer. 

9. Identify the averages in the following hypothetical illustration: Tlic average 
American father in 1953 was aged 44, had 1.5 children, lived in a town of 
3,500, was a native American, spent St,200 in retail stores. 

10. Distinguish between the taoelatcmcnts; The average American male marries 
at age 26; the American male marries at the avenge age of 26. 

11. List the absolute minimum informatioR necessary for the calculation of the 
mean, made, and median from grouped data. 

12. In Community A, the modal length of life b S5, the median ia CO, and the mean 
is 65; m Community D, the modal length of life b 70, the me^n b 65, and 
the mean b GO. From (hb information, retonstntei the frequency eun'ea. 
WTuch community b the healthier? 


Selected References 


McCaith), rhillp J , fxtrwfucliea (o Slattalital Rfeteninf. MeGraw>ILll 
IVjok Company, lac.. Hew York, 1957. Chapter 4 

Moroney, 5f. J., Fottafrom Figvm. renguin Hooka Limited, Itarmoada* 
worth, Middlesex, 1954 Chapter 4. 

Yule, G. Vdnj, and 5f. G Kendall. An /nrr«/ura«n i<> (5e Tlttory efSta- 
tutiet. Fourteenth edition. Ilafnerrubliabiaa Co., Kew York, 1950. 
Chapter A. 


150 



§6.1 Measubes of SpB£.U) 


and below the mean are asymmetrica? to eich other; whether the ^•alues 
become -more numerous aa we approach the mean (uniBwdai), or more 
numerous as we approach the extremes of the range (C^-«Aaped); whether 
the TOlues distribute themselves uniformly over the entire range with no 
point of concentration (recian^ufar), or congregate at rival points along the 
range as in a bimodal distribution. Also, we may judge from the tail of the 
frequency cur\’e whether the tkew is to the right or to the left, as well as 
the severity of the skew. Finally, by superimposing one cur%'e on another, 
it is possible to rank distributions according to their degree of kurtosis, a 
term used to describe the extent to which a unlmodal curve is peaked. 

However, such impressionistic judgments of variation have a restricted 
utility. In fact they may be quite misleading, affected as they are by the 
arbitrary graphic scales. Moreover, such ATSual impressions are personal 
and subjective and therefore can never be precisely communicated. They 
therefore serve to point up the need for objective indexes of variation which 
have a standard meaning. 

^/casuremcnt* 0 / Range. The crudest and simplest measure of variation 
is the total range, or merely the ron^e. By definition, the range is that 
iQter\’al which encompasses ail of the values. Consequently, it is calcu- 
lated m exactly the same manner as a class interval; we take the difference 
between the true extremes of the array, which in this case constitute the 
mterx'al boundaries. Thus, to find the noge of the suicide rates (Table 
3.1.1c), we identify the extreme true values and then subtract one from the 
other. The smallest rounded rate b 3 and the largest b 29; therefore, the 
range is the difference between 2.5 and 29.5, or 27. 

This b the minimum span required to accommodate all of the obscr\’cd 
rates. Had a larger or smaller number of rates been recorded and included 
in the set, the range would undoubtedly have been otherwise. For example, 
the range of rates of cities with 50,000 inhabitants and over would in all 
likelihood have exceeded 27, while the range for the less numerous cities 
of 250,000 and over would probably have been smaller. By augmenting 
the original group of obscrx'ations, the range can cither expand or remain 
unaltered; it cannot shrink. For this reason, two or more ranges should 
in general not be compared unless they are based on approximately the 
same number of items. For example, it would be inappropriate to compare 
the score range of two students, unless each range embraced about the same 
number of test scores. 

For discrete data, the procedure b the same as that for continuous data, 
except that the true Uoiits arc now a necessary fiction. Tims, an array of 
family sires of 2 through 12 has a range of 11, which b the difference be- 
tween 12.5 and 1.5. Thb means that the variable can take 11 and only II 
consccutmexTilues: 2, 3, 4, 5, 6, 7, 8,9, 10, 11, or 12. Some texts bbcl thb 
the »r.cluiire range and define it as (tf — 1») + 1; e.g., (12 - 2) + 1. 


155 



Vasiatjo'# 

salaries arc more ftandanlizcd around the avcra('c with small likelihood of 
extreme poverty or wealth, wherras, in budness, ineomes are more widely 
di-spersed, with both greater risk of pox’erty and oceasional opportunity for 
wealth. 

In general, therefore, the p-attem of \-arialion among the obvf'od 
\alues — wluil statisticians call aprend, seaiter, or di'rpersjon — Is of as 
much relevanec as is the location of the dMrilnition. Various devices arc 
available for the mea'urenient of variatioti and are tf*ually included in 
ever}' well-provided kit of statistical tools Altfaoiigfi there devices differ 
in det.ail, all f.ill into one of three broad categories corresponding to the 
procedure on uhieh they re.'t- fJ) mcaMirfmentu of range that include all 
or jv specific percentage of the items; (2) tnea.«urrmcnls based on des'btions 
of %-amtc3 from a selected central s-alue; and (3) inea.«urement» of hetero- 
geneity in (tualitativc variables. 

Some preliminarj* vbual notion of dispersion in (]uaiititntive data may 
be obtained from graphic eketche-*, *ufh a.s tbo«e pre»€ntail in Figure C.I.I. 
Thus, from the frequency graph we may dl«cem whether the values arc 
dhperred around the mean, or whether the values above 


ITUUrrSlCAl. AtTHUCniCAL 



UnimoiLsl DicncsUI 




5 0.1 Meascres of Spbeaq 

CompuMion of Seleded Quanlile^, ^Innuo! Family In- 
Table 6.\.1 eomt, U^.,1956 




Crur- 


Tot.u, Moxet 

Per 

rated 


I.NCOME 

Cent 

Per 




Cent 


Under $500 


3.2% 

Md - 4,500 + (” X 500] 

$500- $099 

3.3 

6.0 


$1,000- $1,499 

4.4 

10.9 

—4 $4,(83 

$1,500- $1,999 

4.5 

15.4 

0. = 2,500 4- (— X 50o) 

f’.OOO- $2,499 

5.1 

20.5 


$2,500- $2,999 

5.1 

25.6 






$.1,500- $3,099 

6.3 

3S.1 


$1,000- $4,499 

8.0 

46.1 

= $G.S47 

$4,500- $4,999 

6.9 



$5,000- $0,999 

13.7 

66.7 

- 1,000 + X 500) 

$(1,000- $0,900 

9,8 

76.5 

- $1,398 

$7,000- $9,909 

15.6 

92.1 


$10,000-$U.900 

5.9 

9S.0 

Cm - 7.000 + ( X 3,000) 

$15.000-$24,999 

1 5 

99.5 


$25,000 and over 

0.5 

100.0 


Total 

lOO.OTc 



.Y - 

43.443,000 



Scum. U.S. Bureau of the Cetuui. Ceievnt Ffpulatien Rtportt. Senes P-CO, Coiteumer 
ificme. Ko. 36, Table fi. tf3. Govemmeot Pnotioc OSre. Waabuictoa, D.C.. 1967. 


^hich indicates that the middle 50 per cent of American family incomes 
arc located on an inten-al slightly less than $4,000 in amount. 

The interquartile range may be graphically displayed by plotting the 
quartilcs on the base line of the graph (Figure G.1.2), which reveals the 
e\tcnt to which the middle segment of the items are bunched around 
the median. 

Tliis graph also demonstrates that class intervals constructed so as to 
contain equal frequencies will generally not be identical in width. In this 
instance, the four intcrtals created by the quartiles vary enormously in 
width, although each of the four intervals contains crtaeily 25 per cent of 
the total frequency. Such a clxcnhcatioo of items rc%cr=cs the proce<Iure 
of the orthodo’c frequency tabic, in which intenals are made equal and fre- 
quencies are left to tar>'. Here, we set the class frequencies equal and 
permit the class widtlvs to varj'. Both orderings are conveniently shown 
on the same graph. 

A little reflection will indicate that there is no limit to the eon-lruclion 


155 











Varution 


Simple in conception and calculation, the range, like every other statistic, 
provides only limited information. Because its significance is relative to 
its location on the scale, it will generally be more serviceable when quoted 
in conjunction with its boundary piwts. Everyday usage recognizes the 
soundness of this principle in such expressions as “Prices on new cars will 
range from $4,000 to $5,000, "or “Tomorrow’s temperatures will range from 
a low of 42° to a high of 78®.” A salary schedule that extends from $5,000 
to $10,000 has the same absolute range as one that extends from $20,000 
to $25,000, but they have very diifereat connotations. In choosing a 
vacation site, it is not enough to know that the range in temperature is 
30°; it is equally necessary to know the scale location of the extreme 
temperatures. 

The range has the further (diaracteristu; that it disregards the pattern 
of variation between the extremes, and yet at times this pattern may be 
of greater import. The range of annual family incomes in the United 
States, which is greatly in excess of one milliOTi dollars, gives no clues to 
whether the incomes are compactly bunched in the middle, concentrated 
at one end, or uniformly spread over the entire scale. 

Furthermore, in most observed distributions, the extreme values are 
infrequent, erratic, and unstable; hence, the overfall range, which rests 
exclusively on these extremes, may leave the impression of a greater volume 
of variation than actually exists. By basing the age range of college 
students on the singular 14*year>old prodigy and the 64-year-oId mother 
who wishes to attend school with her grandchildren, we obtain a range of 
$1 years. But this result obscures the fact that most students differ from 
one another by only a few years, and the total range is therefore misleading 
as an mdex of variation. 

Intermediate Ranges. This dependency on the almost unique extreme 
observations may be overcome by computing an intermediate range which 
excludes a nunor fraction of the cases at cither end, but which still includes 
a significant portion By basing it on the less exceptional items, the range 
acquires greater stability and dependability. One common practice is to 
take the difference between the 90lh and lOthcentUes, and thereby establish 
a range that includes the middle 80 per cent of the cases. Applying this 
procedure to the distribution of American family incomes in 1956 (Table 
6.1 1) we obtain the 10-90 range: 

fta - C,t = $9,596 - $1,398 = $3,193 
A still more restricted range is the span between the first and third 
quartiles or the interval that subtends the middle 60 per cent of the items. 
It is naturally called the inlergtiartile range. The interquartile range of 
family incomes is: 

Qi - Qi “ $6,847 - $2,941 = $3,906 


154 



§ 6i2 Yaeiatiox MtASCTtED BT ABmocmc Detiatioxs 

2. YTien are two or laore rangea comparableT 

3. How would }-ou calculate the range when the ends of the frequency table are 
open? 

4. Explain why the quartiles and the median practically never divide the total 
range into four equal intervals. In what type of curve will they partition the 
range equally? 

5. The scores on a true-false sociology teat of 100 questions ranged from SO through 
90. How would the range be affected if: 

(a) a bonus of 10 had been given to each student? 

(b) the poorest students bad been successfully tutored? 

(c) an easier examination had been used? a more dlihcult exam? 

(d) 200 items had been used instead of 100? 

6. Calculate the 9 decile points of the percentage distribution of family incomes 
(Table 6.1.1). 


SEcno>i Two 

Variathn as MtastarJ hy Arithmtiic Deviations 

It has been sho^m that the range, or any segment of it, will supply an 
impression of the span of a distribution. Although such a mensuire has 
some utility, especially when assocLnted with its location, it represents 
merely the limits of variation, rather than the aggregate \*ariation within 
those limits. It docs not reflect the variation of the individual items, but 
merely the variation of the observed extremes, ignoring the many inter* 
mediate values. It may therefore be reiterated that the range measures 
the boundaries of scatter, but not the total amount of variation of the 
collectivity. Hence, some measure must be devised which will reflect the 
extent of diversity among all the items. 


157 



Variation 



lQt«r<]UArUl« 

Range 


FlotniE 6.1.2 Frtquenty Annual FamQx/ Inttme, VS., 1956 

of intermediate ranges, such as the 10-90 or the interquartile range. Any 
intermediate range will bring into clearer focus the relative degree of 
concentration or scatter among the items, particularly when viewed against 
the total range. The fact that the range of American family incomes is 
considerably greater than one million dollars, but that the interquartile 
range is only S3, 906, and the 10-90 range b hut S8,19S, b suggestive of^ 
the essentially high degree of homogeneity among family incomes in the 
United States. Frequently, the use of stiat^ically placed juartliUs (the 
genera! term for partition values) will be an altogether eatbfactory approach 
to the analysb of varbtion, and more complicated methods need not be 
pursued 


QuEniONS AND P&08U^t.IS 


1 . Defia# the {allowing coneepta'. 
Variability 
Variation 

Total Range (Range) 
Intermediate Range 
Interquartile Range 

156 


10-90 Range 

Dispersion 

Scatter 

Spread 

Kurtosis 



5 6J2 Variation Meastred bt ARiTBsiEnc Deviations 

more importantly, the sum of the arithmetic de\’iations around the median 
is less than from any other point of origin. Expressed in another way, the 
median is that point around which the arithmetic “errors” are least. This 
may be termed the principle oj minimal deviation. The organization of an 
array around the median will therefore pve us the most economical and 
tightly knit arrangement of items. Hence, the median would appear to be 
the more logical origin whenever the measure of variation is based on the 
absolute, or arithmetic, deviations. 


Construction of a Decialional Afeaaure: Average Deviation (AD). The mere 
sum of the arithmetic deviations is useless as an index of variation, since 
it will vary with the number of items in the distribution. For example, it 
will generally be large when there are 1,000 items, small when there are only 
10 items. To eliminate this adventitious factor, we divide by N and thereby 
measure the deviation per case. This result is termed the mean deviation 
or average deviation (AD). 

( 6 . 2 . 1 ) 


where I d | ■ deNuation from median, sign ignored. Appljing this formula 
to Table 6.2.1, we get: 


ADui 


?r 

5 

4.4 


The average delation may also be based on the mean, in which case the 
above formula would read: 


ADx 


2UI 


where i i [ = a deviation from the mean, sign ignored. Thus, 
ADi = ^ 


which is larger than that based on the median. Hence, whenever skew is 
present, the median always serves best as the point of origin to express 
the amount of variation in the distribntion. 


Coeficient of Relative Variation (CBT^.* It is now clear that any deidation 
takes on significance only when compared to its own origin, or norm. A 
variation of 52 in relation to a base of $10 conveys a different meaning 

•Somelimes »Titten CV. BrosdJy liefioed, a fvejieieni is a measure oS relationship 
between two variables, expressed aa a ratio, proportion, or percentage 


159 



Varutios 


It is also quite conceivable to tbinlc of death rates, student’s grades, or 
teacher salaries in relation to a maximum or minimum — especially when 
they are in close proximity to these observed extreme. A teacher will 
perceive his salary as near the c^ng of hia professional category. Never- 
theless, it is more usual to employ a cmilrsl value as a point of reference, 
because most values are actually in proximity to that central point. In 
other words, we tend to or^nixe our observations around an average as 
a norm, on the assumption that this average is a representative value and 
therefore worthy of being used aa a base of comparison. The only statisti- 
cal issues stiif remaining are: (I) from what average to compute the devi- 
ations, and (2) how to summarize these deviations in a compact index. 

The Ounce of a Norm. Obviously, with symmetrical distributions that are 
unimodal, it mates little (Terence which average is used, since mode, 
median, and mean are equal. But concrete data often fall short of even 
approximate symmetry around a ungle peak. Hence, the problem bone 
of selecting a representative value in those distributions where the mode, 
median, and mean diverge one from another. 

Unquestionably, many an observer informally employs the mode the 
most frequent of his observations— as a base of comparison. But on 
theoretical grounds both the mean and the median can present somewhat 
stronger claims to the title of representativeness, and for that reason they 
are exlenrively employed in statistical calculations of this type. 

As we have learned, the mean is the value from which the deviational 
variation on the two sides are in balance (Table 6.2.1); hence, the mean 
would appear to be the most reasonable point of origin of the deviations. 
However, the median's claim is at least equally impressive. The median 
is the position which divides the array into two equal parts; hence, and 


Algebraic and Ariihmetie Z)ew'oti<mj /rom 
Tabu 62,1 Mean and Afedion, irAwe 7! = 12 and 
Md = 10 


Value 

Aloebkaic Deviation 

AarrauBTic Devi«tov 

X 

X-X 

X-Md 

ix-xi 

) X - Md ) 

6 

-6 

-4 

6 

4 

8 

-4 

-3 

4 

2 

10 

-2 

0 

2 

0 

15 

3 

5 

3 

5 

at 

9 

U 

9 


60 

0 

10 

21 

22 


Sourtt ITjT>o«i«tic»l 


158 




§ 6-2 ViSiiTioN Meabcexd bt ABOTumc Dctutionb 


CRy=-^X100-407o 

3.04 

CRV “ X 100 = 36% 

The results demonstrate that the two gnnips of states are approximately 
equal in relative variation; in fact, the South Atlantic group now appears 
to be even slightly more homt^eneous tiian the New England states. This 
conclusion is independent of N and of scale location, both of which were 
neutralired by this operation. 

The coefficient of relative variation is ample to compute, and particularly 
useful in comparative work, ance it has the effect of norming for differences 
in absolute magnitudes and in substantive units of measure. It makes 
comparable sets of small and huge values of the same kind, as well as 
values that are qualitatively dlfferenL 
The CRV is not applicable, however, unless (1) the observed measures 
have a true zero, and (2) the scale intervals are equal — in short, unless 
we have ratio scales. It is accordingly not to be used to gauge the relative 
variation, for example, in measures of social distance, intelligence, and atti> 
tudes. The reasoning is as follows. The CRV is designed to standardize 
for differences in location of absolute measures, or more speeiffcally, for 
differences in absolute magnttudea of central values such as the mean or 
median. But when an absolute zero is nonexistent, we must necessarily 
assign arbitrary values to a series of observations. Thus, a social-distance 
measure of 30 may represent the same objective fact as a measure of 120, 
according to the scale system arbitrarily set up by the investigator. Such 
scales are not anchored to an empirical zero which represents the absence 
of that phenomenon. If, then, locational measures are arbitrary, any 
standardization or correction would be devoid of meaning. 


Questions and Problems 

1. Define the following concepts; 

ArilhmeUc Derlation 
Algebraic Deviation 
Average Deviation (AD) 

Piincipte of Minimal Deviation 
Coefficient of Relative Variation (CRV) 

Absolute Zero 
Arbitraiy Zero 

2. VTien is it more logical to base the AD on the median rather than the mean? 

3. Eleven houses are located on a atieet that runs due east and west. IMiich 
house is connected by the shortest aggr^te distance to all others? Demonstrate 
graphicaliy. Bow would the question be rephrased if there were ten houses? 

161 



Vasiation 


from what it does in relatioQ to a base of SIOO. A variation of an inch in 
the length of the ann is no more than be expected, but a variation 
of an inch in the length of the nose would be catastrophic. This principle 
is recognized andimpleTnenledialhewKj^'ento/ farialion, which expresses 
the measure of variation as a percentage of its origin, be it mean or median. 
It thereby eliminates the extraneous factor of scale location. In the case 
of an A Z) based on the median, the formula would read: 

CRV-^S^XIOO (6.2.3) 

Ma 

If the AD is measured from the mean, we of course replace the median by 
the mean in Formula 6.2.3. 

Let us employ this procedure to compare the relative variation between 
homicide rates in the New England and the South Atlantic states (Table 
6.2.2). After comparing the raw AD’s of New England (.37) and the South 


Campul-xitcn of Actntgo Deviation, Uomteide Ra{e$ per 
ToMe 6.2.2 100,000 Population, Neu> England and South Atlantic 

Stales, tS5S 


New EN'otj^.s'0 

ESHDl! 

Soorn Atlantic 

Rate |rf( 

Maine . . 

Massachusetts . 
Vemsont 

Rhode Island 
Connecticut . 

New Hsmpshire 

£2 .10 

A3 J09 

AS .07 

.93 M 

1.7S A6 

1.95 IJ» 

S -.221 

Delaware 

West Virginia ... . 
South Carnlina. . . 

Marj-land 

Vir^ia 

Florida 

North Carolina . . 
Georgia 

3.29 5.47 

6.01 2.75 

7A9 1.17 

8,56 d!0 

8.95 .19 

9.97 1.21 

11.19 2.43 

20.67 11.91 






$6^ Va&ution Measured bt Sqcibzb Detiatioks 

lions from the median is minimal, eo the sum' of. the squared deviations 
from the mean is also minimal. This is an exemplification of the principle 
of least sguares, which is one of the most venerable and vital principles in 
all statistics, known and practiced for ISO years. 

The technique of squaring deAdatlons may, at first glance, seem unnec- 
essarily circuitous and superfluous. If variation can be satisfactorily 
measured by ample deviations, what additional information and insight 
can be gained by squaring them? A completely adequate answer to this 
question will be possible only at a later stage of the student’s statistical 
studies. It must here suffice to state that the practical utility of such a 
measure is incomparably greater than that of the AD, which is less fre- 
quently called into use. The squared deviations may be expressed in 
several ways, each of which serves its own purpose: sum of squares (SS), 
rananee (V), and standard derialion (SD). 

The Sum of Squares {SS). Just as we computed the sum of the arithmetic 
denatioos from the median and the mean in Table 6.2.1, we may now com- 
pute the sum of the squared deviations from both averages (Table 6.3.1). 
Although the sum of simple arithmetic deviations is minimal from the 
median, we see in Table 6.3.1 that the sum of the squares of deviations is 
minimal when these deviations are measured from the mean. 

This result may seem paradoxical. Mathematical proof for this apparent 
inconsistency cannot be offered here, but a partial esplanation lies in the 


Sum of Squares of 2?f«’otton* from 
Table 6.i.l Mean and Median, where X « If, 
Md^W 






(X - Mdi* 

■nj 

6 


36 

16 


4 


16 

4 


2 


4 

0 


3 


9 

25 


9 


81 

121 

Mm 

24 


SS = 146 

SS = 166 


Source. Ejpothttieil 


fact that squaring is a geometric computation which has the effect of 
weighting disproportionatdy the deviations as they increase in magnitude. 
Since the mean equalizes the ne^tive and positive de\iations, it operates 
to reduce to the limit the relative frequency of the large deTOtions, and 
thereby jnmlmizes the SS, usually written Zx*. 


163 







Varutios 


Oimpufe the AD (tom the tnedisn tor tlie follonics jet (A' ■> 20); 


S 


0 

13 


(a) How ftould the A/ibcaffectwlilfaeh value were increased by 107 

(b) Kow would the C/fV be affecte«l? 

5. Table G2.3 shows the percentaBe of each repertoire (c.r , Boston Orchestra, 
Chicago Orchestra, etc.) devotetl to each of si< leading composers. Caleulato 
the metilan percentage for each of the composer*, the average ties-btion ba.«ed 
on themedian, end the CAP for each composer RanV the composers according 
to their C/fV, and interpret. 


Talfe 6.2.3 Compoitr lirpTttmtation, OrcAcrfroI /?ep<Tfoirc*, 19iO-lD',^ 


CovrosiR 

OsCHtSTtlA 

Bol- 

ton 

Chi. 

nfo 

On. 

ft«ft 

Cfoe- 

land 

Mime. 

cpolie 

A'fir 

IVl 

PAiUar. 

Phila. 

drlpKfo 

St. 

LouU 

Baeh 

OS 

4,0 

7X> 

IS 

2,3 

2.0 

3.3 

■n 

Beethoven . 

OC 

100 

8.1 

11.0 

11.4 

102 

10.7 

BQ 

Berlioi 

2.5 

1.6 

1.1 

0.7 

3.6 

2.3 

O.C 


Brahms 

02 

0.4 

7.4 

112 

11.4 

08 

102 


Prokofiev . 

1.2 

12 

0.5 

1.1 

1.1 

1.1 

12 


Tschaikowsky 

45 

5.9 

72 

96 

7S 

54 

62 



Sourc* John >1 Mu»U»f, "Tho Mowumnenl of A»»th»lio Folkwajn," ,tii*«rw«a JoucmI 
a/ ;<or>a(.VV. t-I KMO, p STS (*itapl««t «t(h Oricinnl daU Irocn aulhor). 


Sectio.v Three 

Variation as Measured bj Squared Deviations 

Principle 0 / LeatlStptarts. The measurement of varmtion by simple arith- 
metic deviations from a central value is a straightfonvanl procedure. If 
a simple statement of dispersion were all that wm wanted, the easily com- 
prehended AD would serve quite well. Ilowcvxr, variation may bo, and 
b even more commonly, me.vsured by squared dcvi.itions, which are uni- 
versally taken from the me.an of the series. Tlic fund.imcntal logic of this 
procedure la identical with tliat which argues for the simple deviations 
from the median: the principle of best fit. Just as the sum of tlie devLa- 
162 







5 6 J Vakiation Msastoed bt SqaiSED UenATioNs 

tions from the median is Tninimal, so the sum' of. the squared deviations 
from the mean is also minimal. This is an exemplification of the principU 
0 / Uost sguares, which is one of the most venerable and vital principles in 
tJl statisUca, known and practiced for ISO years. 

The technique of squaring deviations may, at first glance, seem unnec- 
essarily circuitous and superfluous. If variation can be satisfactorily 
measured by simple delations, what additional information and insight 
can be gained by squaring them? A completely adequate answer to this 
question will be possible only at a later stage of the student’s statistical 
studies. It must here suffice to state that the practical utility of such a 
measure is incomparably greater than that of the AD, which is less fre- 
quently called into use. The squared deviations may be expressed in 
several ways, each of which serves its own purpose: sum of squares (SS), 
variance (V), end standard deviation (SD). 

The Sum of Squares (SS). Just as we computed the sum of the arithmetic 
deviations from the median and the mean in Table 6.2.1, we may now com- 
pute the sum of the squared deviations from both averages (Table 6.3.1). 
Although the sum of simple arithmetic deviations is toiaimal from the 
median, we see in Table 6.3.1 that the sum of the <^re» of deviations is 
minimal when these de>iatioQS are measured from the mean. 

This result may seem parado-tJcal. Matbea^atical prool hr this apparent 
inconsisteDey cannot be offered here, but a partial explanation lies in the 

Sum of 5gtior«5 of Deviations from 
Table 6.3.1 Mean and Median, where S - IS, 

Md’-lO 


X 

\x-'S\ 

IQB^j 



6 

6 

4 

36 

16 

8 

4 

2 

16 

4 

10 

2 

0 

4 

0 

15 

3 

5 

9 

25 

21 

9 

11 

SI 

121 

60 

24 

22 

SS = 146 

SS = 166 


Source; Hypothetic*! 


fact that squaring is a geometric compulation which has the effect of 
weighting disproportionately the deviations as they increase in magnitude. 
Since the mean equaUzes the negaUve and positive deviations, it operates 
to reduce to the limi t the relative frequency of the large deviations, and 
thereby minimizes the SS, usuaDy written 2i*. 


163 







Vabution 


Variance (T). Although'the Bum of squares is necessarily employed in 
various statistical procedures, it is not a meaningful index of variation — 
and for the same reason that the sum of the arithmetic deviations around 
any average is not. A meaningful index of variation is obtained by taking 
the mean of the sum of squares, and thereby eliminating the variable 
factor of frequency. This result, called the variance, and most commonly 
symbolized o’, corresponds procedurally to the average deviation. The 
formula is: 


? 5 ! 

“ N 


(6.3.1) 


The Standard Deviation (SD), or Sinma (o). Since the variance is derived 
from the squared deviations, itisnot a linear measure. If such a measure 
is required, as it usually is, we have only to unsquare the variance in order 
to restore the linear stale. In this form, it is known as the ttandard devi- 
otion, or stffuia, symbolized by SD, or the lower-case Greek a. 



As such, it is used as a measure of variation quite analogous to the AD, 
from which it differs primarily in the fact that the deviations were squared, 
and their mean unsquared. However, the unsquaring does not cancel the 
total effect of previous squaring; the weighting effect in part remains- If 
it had been completely nullified, the result would have been the simple AD 
from the mean. 


CompxOalionof Variance and Standard 
fable 6.3 Ja Deii'attan, Homicide Rates, South At- 
lantic Slates, I95S 


BVAT* 

X 

* 

*• 

Delawsre 

Florida 

Georgia 

hlarylaod 

North Carolina.. . . 

South CaroUna .... 

Virginia 

West Virginia 

3 

10 

21 

8 

11 

8 

9 

6 

2 “76 

X-9.5 

-64 
04 
114 
-14 
14 
-14 
-0 5 
-3 5 
0.0 

42.25 
.25 

13255 

2 25 
2.25 
255 
55 

12.25 
SS = 194.00 

1 24.25 

1 SD - V = Viji .. Vilis a 4.9 


164 




5 ViBUTION SlEAStBED BT SOTTASED DeTIATIONB 

Comjmiatim of the Standard Devialion. (a) Ungrovped Data. In prin- 
ciple, the etand&rd deriation is more complicated than is the 

average de^•iation, involving only the additional Bteps of squaring the 
deviations and unsquaring their mean. This is illustrated in Table 6.3.2a. 

The calculation of the standard deviation on a large aggregate would 
be extremely laborious if each deviation had to be individually measured 
and squared. There are, however, computing formulas which greatly 
simplify such calculations. These formulas all involve one or more coding 
operations, similar to those emp1<^ed in the ease of the mean, although 
these operations are masked from view by the terms of the formula. A 
particularly expeditioua method for finding the standard deviation of 
ungrouped data performa all calculationa on the raw observed values 
themselves. 

In the table below are presented the computing formulas that comprise 
this method alongside the baric formulas in order to emphasise the corre- 
spondence between them. To illustrate the computing routines, they are 


(6.3.8) 
(6J.4) 
(6.3.5) 

applied to the homicide rates of the same eight South Atlantic states 
CTable 6.3.2b). 


fiame 

fi'clation 

Basie Formula 

Computing Formula 

Sum of Squares 

Yariance 

Standard Demtion 

e* 

a 

Zi* 

Zi* 

H 

V AT 

¥-(¥)■ 

\/¥-(¥r 


Computation of Variance and Stand- 
Table6.i2\> ard Devialion hy Coding, Homicide 
Rales, Sordh AHanlie Stales, 195S 



165 





Vasiatios 


These yield exactly the same answer as that obtained by the basic formula. 
While the calculation of the SD wiU thus normally be executed on the basis 
of the computing routines, its interpretation will always be in terms of 
the basic definitional formula 

(b) Grouped Data. To obtain the SD of grouped data, simple logic 
would suggest that we measure the deviations of the class midpoints from 
the mean and weight these according to the frequencies in the respective 
intervals: 

where / = class frequency 

X = deviation of class midpoint from mean 
But these cumbersome arithmetic operations do not match the seductive 
simplicity of the logic involved. Hence, we once again resort to coding 
midpoints by interval deviations from the guessed mean, as was done in 
the computation of the mean of grouped data (see Chapter 5, pp. 136-13S). 
The augmented worksheet requires only one additional column in order to 
provide for the squared deviations (Column 5, Table 6.3.3). In the table 
below, we have once again placed computing formulas alongside the basic 
formulas to emphasize their equivalence, and in Table 6.3 3 we apply the 
computing formula of the SD to the tabulation of suicide rates. 


Name \ 

Notation 

Batie Formula 

Computing Fomxita 

Sum of Squares 

No' 

ID* 


VaiisBce 1 

<r» 

Sfx* 

N 


Standard Deviation 


\/f 



(6.3.7) 

(6.38) 

(6.39) 


The CM^cnl of Relatm VariaUm {CRY). Like the AD, the standard 
deviation may be converted into a measure of relative variation by norm- 
tng It on its own origin, namely the mean. 

CRY - ^ X 1(K) (6.3.10) 

Appling it iUustratively to the suicide data, where X - 12.5 and ff =■ 6.4, 
we obtam; 




§ 6.3 Vabution Meabtoed bt Sqttabed Deviations 


Computation of Standard Deviation by Coding, Grouped 
Table 6.3.3 Data, Suicide Rales, i07 Large U.S. Cities 



As previously stated, such a measure U not likely to be used in isolation 
from other comparable measures, except occasionally to indicate the ex> 
tent of scatter in relation to the magnitude of the mean. Thus, a small 
sigma value relative- to a large mean represents great homogeneity of the 
data, and consequent typicality of the mean, which may be an important 
piece of information under certain circumstances. A mean of $125, with 
a<r of $5, giving &CRV ol 4 per cent, is more representative of the array 
than a mean of $125 with a <r of $25 and a CRV of 20 per cent. A CRV 
of zero indicates no variation at alf, and that the mean is utterly typical. 

However, the CRV is more likely to be used in outright comparisons 
between series of related data. The relative variation in wages in the East, 
for example, may be less than in California. In the measure of public 
taste for a given composer, a high CRV would result from great disagree- 
ment; a low coefficient on the contrary would reflect a tendency toward 
consensus in taste. 

It is phenomena like these that the CRV is designed to measure in the 
form of a simple, precise, and compact index. It should be stated here 
that, to the e.xtent that the SD exaggerates the variation around the mean, 
the CRV correspondingly exaggerates the relative variation. But whether 
the CRV is based on the SD or AD, it is of course inapplicable when the 
observed measures have no absolute zero. 

Charaelerislics oj the Standard Deviation. For almost a century, the SD 
has been one of the most powerful tools in statistical analysis, not only as 
a measure of dispersion, but also as an ingredient in more complicated 
computations. There are two intn'locking characteristics of the sigma 

167 






Vahiatiow 


which bear the credit for the efficacy of tlus tool. Firsti the standard de- 
viation reflects the magnitude of every variate of the series twice over; 

(a) the point of origin from which the deviations are measured (X) is 
itself an arithmetic reflection of the magnitudes of all the variates, and 

(b) every magnitude, as magnitude, is represented arithmetically in the 
squared deviations. Secondly, the squaring of the de^datioDs constitutes 
a mathematically legitimate method of clearing the sigiis. While the AD, 
measured from the median, “ignores” the signs, the standard deviation 
“clears” the signs mathemaUcally; consequently, there is no breaking 
of the chain of mathematical (^rations. 

However, for descriptive and terminal purposes, the estpedient of ignoring 
the signs in the calculation of the AD is quite legitimate. Because it 
measures the deviations directly, without squaring, it is smaller than the 
SD, which inflates the deviations disproportionately by squaring. How- 
ever, as already implied, ignoring the rigns disqualifies the AD, irrespective 
of its origin, from participating in further algebraic corOputationa. 

So “standard” has the standard deviation become that, by force of 
habit, it is employed as a simple, literal description of variation, even 
when the AD would be more faithful to the absolute deviations. This 
would be the case whether the deviations were measured from the median 
or mean, both of which are sanctioned by general practice. 

Questions and Problems 

1. Define the following concepts: 

Principle of Least Squares * 

Sum of Squares 
Variance 

Standard Deviation 
Basic Formula 
Computing Formula 

2. Explain « hy the standard deviation is ordinarily used as a measure of dispersion 
instead of the variance. 

3. Why is the SD larger than the AD* 

4. Using small sete of illustrative data, venfy that the sum of the squares is least 
from the mean. 

5. Is it possible for an array to have more than one SDT 

6. Is it possible to calculate the SD of qualitative data? 

7. From an examination of tic bam formula for the SD, explain why “ua- 
squarmg” does not completely nullify the effect of the previous squaring. 

3. Compute the SD of the series pwva in fVoblem 4, p. 162. Add a constant 
(e.g , 20) to each value and eompute the SD of this transformed series. How 
is the SD affected? 

168 



§6.4 Normal DisTRiBTn'io>J as PAtTERN or Vabiatios 


9. If the mean age of a group of coll^ students is 20 years, with a standard 
deviation of 2, what will be the tnean and SD of that group 20 years later? 
The CRVt 

10. Compute the SD of the frequency distribution of delinquency rates (Prob- 
lem 11, p. 47) by the computing formula for grouped data. 

11. Discuss the aptness of (a) the range, (b) t!ie interquartile range, (c) the average 
deviation, and (d) the standard deviation for purposes of describing the 
dispersion of the following distribution (Figure 6.3.1): 


Figure 6.3.1 Sinred Dit- 
Iributim 



Section Four 

The Normal Distribution as a Pattern oj Variation 

The Concept of the Normal Frequency Distribvtion. One of the most im- 
portant applications of the SD lies in its description of the normal curt’e. 
In fact, it may be claimed that as a measure of dispersion, the standard 
deviation has meaning only insofar as the pattern of variation is normal. 
Hence, an adequate discussion of the standard deviation must specify its 
fundamental relation to the normal curve, the most illustrious of all pat- 
terns of statistical variation. 

The history of the normal curve dates back to 1733, when Abraham de 
Moivre first established it in the course of his investigation into games of 
chance. Later, in his more pious mom«)ts, he contended that it was a 
manifestation of divine order in the universe. 

Since that time, it has served various purposes. To astronomers, it de- 
Bcribed'the distribution of measurement errors around the "true” value; 
hence, it was often characterized as the curve of error. The Belgian statis- 
tician Qu4tclet, during the 1830's, was the first to apply this curve to 
social, psychological, and anttmqmmetric data. His own concept, I'homme 
moyen — the average man — which for him described the norm, flanked 
by "nature’s errors,” is the logical basis for the now prevalent terminology 
of "the normal distribution.” The curve was also employed to represent 
a sampling distribution of all pos^le sample values, which is today one 
ofitsmost significant uses — s topic which will be treated in a la ter chapter. 

169 



Vaeiatios 


Qu^telet was of the opinion that social and moral data tend to array 
themselves on the normal curve, which is thereby given the sanction of 
nature. However, the "normal curve" no longer carries this eulogistic 
connotation. Today it would be a mistake to consider non-normal dis- 
tributions as unusual or unnatural. Thedistribution of raw, empirical data 
may naturally conform to any one of a number of curves. Nevertheless, 
as a statistical model in fitting nature’s vanation, the normal curve is still 
unrivaled in scope of application, notwithstanding the fact that it does not 
possess the universality which Qu^telet attributed to it. 

Ckaraclenstics of the Normal Cime. By definition, the ideal normal dis- 
tribution contains an infinite number of cases, is unimodal and symmetrical, 
and unbounded at either end. Consequently, mean, median, and mode 
are identical in value, and divide the array into two equal parts. The 



graphic version of this distribuUon is a smooth, bell-shaped curve, with a 
charactenstic slope that never toudies the base line. Starting at its peak, 
the curve falls more and more rapidly up to the point of inflection, and 
then gradually levels off, extending indefinitely in either direction. This 
point of inflection is exactly one standard deviation distant from the mean 
origin. Graphically, then, the standard deviation is the linear distance 
along the base line from the mean to tiie ordinate defining the point of in- 


170 



1 6.4 NoRitAL Distrjbdtios as Pattebit oy Variation 


flection. This unit has been almost univeisally employed in measuring 
distances from the mean since it was first approvingly launched by Ivarl 
Pearson. 

For every possible sigma distance from the mean, there is necessarily a 
corresponding percentage of area, or frequency; hence, sigma points 6er\’e 
as convenient measures of location. If ^ve travel one sigma from the mean, 
wc leave behind 34 per cent of the items. But if we continue on to the 
second sigma point, u e would not have put behind 68 per cent of the items, 
but rather only about 48 per cent, owing to the steady decline in density 
as the distance from the mean becomes greater. Since the distribution is 
symmetrical, approximately two-thirds (68.26 per cent) of the cases are 
included within the interval that extends from 1 SD below to 1 SD above 
the mean; 95 per cent (95.44 per cent) fall within 2 SD’s on either side; 
and practically 100 per cent (99.72 per cent) fall within 3 SB’s on either 
side of the mean. 

Similarly, it is possible to deCenmne the proportion of cases between the 
mean and any other sigma value on the base Une. Because this type of 
information 1ms so many applications, reference tables have been prepared 
for the convenience of statistical workers. ^Hie most common of these 
tables, the Table of Normal Areas, gives the proportion of the total area 
or items under the curve between the mean and selected multiples of the 
standard deviation. A systematic condensation of that table is reproduced 
below. Like the more complete source, from which it has been taken, it 


Table 6A.I Table of Normal Areas 






























































HH 


B9 


^9 



Source: See TMe I, Appendix, p. 41S. 


describes only one-half of the distribution — which suffices, of course, since 
the distribution is symmetrical. 

Examination of the table indicates, for example, that approximately 


171 





Vabutios 


.4953 of the items lie within 2.6<r of the mean, so that practically 99 per 
cent lie witliin 2.6<r on either aide of the tnean. Consequently, for all prae* 
tical purposes, the normal distribution has a range of six sigmas. 

The Standard Deviate. The foregoing description of the normal curve is 
an abstract one, given in terms of the standard deviation measured from 
the mean as an origin. As such, the unit of measure is independent not 
only of diverse measurement systems, but also of the concrete values them* 
selves. It makes no di^erence whether we are dealing with incomes of a 
hundred or a million dollars, with durations of ten seconds or ten years, 
or with varying intensity of attitude. 

But sets of data always come to us as raw measures and give every 
appearance of being non-comparable. We cannot readily compare, for 
example, teachers’ salaries and years of service, even though both variables 
may be normal in their distributions. The solution to this problem of com- 
parability, of course, lies in converting the raw measures into sigma units 
of measure, which are comparable. We express raw deviations from the 
respective means as multiples of their standard deviations. Hence, such 
measures are called ttandard deviotet.* They are coosntionalJy i^ym- 
bolizedt. 

(6.4,1) 

9 

Z 

V 

By plotting absolute and sigma scales on the base line of a normal fre- 
quency graph, it b possible to dbpby visually the equivalence of raw 
measures and standard deviates. It becomes plain, for example, that a 
teaeher’ssalary of $6,000 comcides with a standard devbte of +1.00; both 
values represent the same objective fact. Moreover, by thb device of 
multiple scales, it b possible to exhibit conveniently the essential identity 
betw een normally dbtributed variables which are seemingly very different. 
Thus, the illustrative graph makes clear that a teacher’s salary of $6,000 
is statbtically identical with 16 years of teaching service, both lying one 
sigma from the mean. 

This transformation to standard form, which at first may seem awkward, 
will gradually become second nature to every student of statistics, since 
it finds such a wide variety of application. Every student, whether or not 
he is familiar with statistical processes, well knows that raw total scores 
of, say 300 and 500 in Englbh and Mathematics, may or may not be equiv- 
alent in rebtive grade value. But Upon dbeovering both to be two sigmas 
above the mean, the stalblicaSly truned student will correctly judge them 
* Synonyms include: lUtnJard meanre, ttandard imre, t-mrature, rrlatin daiaU, and 
eigma totue. 

172 



§ 6.4 Kosual DisTEiBxrnoN ab Pjattebn or Vabutioh 



4000 4500 5000 5500 6000 6500 7000 

DOLLAB mCOMX: 

.J L -I I ’ ' ■ 

0 4 8 12 16 20 24 

TZABa or SEBTICS 

FiouBE 6.4.2 Teaehtr Iname and Ytart of Senia at Sigma I7nt(« 
(ffgpolhetieal Dala) 

to be idetitical, since in both instances 98 per cent of the grades are pre- 
sumed to be lower. Thus, the z-measure serves to establish the relative 
position of so item in a given array, and thereby renders corresponding 
items in two or more normal arrays comparable. 

Calcxilation of the Stnndorrf Demote. To crovert any series of values into 
standard deviates, it is first necessary to compute the mean and the SD 
of the series. Such computations are carried out on the data shown in 
Table 6.4.2, winch have purposely been curtaUed for ease of comprehen- 
sion. The first variate is 1, which deviates by —3 from the mean value 
of 4. Since the SD » 2, this variate deviates by — That is, the 
value I translated into standard form is equal to —1.5. The remaining 
standard measures are similarly calculated. 

These z-measures enable us to fix the relative position each item in 
the array by referring to the table of normal areas, or frequencies. Thus, 
from that table we read that l.Scr, measured from the mean, corresponds 
to approximately 43 per cent of tiie total frequency, which leaves about 
7 per cent of the items above that point. Although this latter figure cannot 

173 



VAJJMTiaV 


Table 6.4.2 

Computation of Standard Deviates 



be read directly from the table, it may readily be calculated by subtracting 
the table entry from 60 per cent, as in the above example. It should be 
emphasized that, although sigmas may be mechanically computed for any 
distribution, whether normal or not, the foregoing interpretation of sigma 
measures is valid only for the normal distribution. 

Choice of Measure of Variation. The z-measure is not the only standard 
measure in the most general sense of that term. Centiles, such os the 
median and the quartiles, are also independent of the concrete data and 
the measuring scales, and are therefore equally deserving of the title 
"standard measure ’’ In fact, since they are more generally applicable, 
they arc perhaps even more deserving of that appellation, because their 
meaning does not depend on the fom of the distribution, as is true of 
z-measures. This is merriy one of many instances of a generic term being 
taken over in a specific setting and asrigned a specialized meaning. We 
therefore remind the student that each type of measure of variation has 
its appropriate function, and that familiarity with both its statistical char- 
acteristics and the substantive problem is necessary for an informed choice. 

When the data are quantitative, the chcnce may depend on one or more 
of the follmving overlapping criteria; (1) the objective of the inquiry; 
(2) whether the measure is used as a terminal description or as a preliminary 
to further computations; (3) the pattern of the distribution, whether sym- 
metrical or skewed; (4) the degree of completeness of the source data. 

If our limited objective is only a q[uick impression of the scatter or 
density, the range may be quitt: satisfactory. However, a more reliable 
Jinpre'sion of the essential compactness of a distribution is conveyed by 
the intermediate ranges, such as the interquartile range, or the 10-90 decile 
range. These measures have the added versatility that their interpretation 
is completely independent of any particular pattern of distribution. In 
this sense, they are “distribution-free " 

174 



§ 6.4 Norjlll DisTBiBimoN AS Pattebs or Vaeiation 


All summary de\'iational measures, of course, express the amount of 
donation from a central value. Thraefore, they are useful companion 
pieces of information to the statement of the central value it.'sclf. If a 
plain and simple measure of the volume of scatter around a central value 
b desired, the average deviation recommends itself as the most forthright. 
The standard deviation may be used for the same purpose, but it will not 
serve as well: in the first place, it is less comprehensible because of its 
more complicated derivation; and, secondly, the distortion produced by 
squaring reduces its descriptive value. In spite of this, there are those 
who adhere more or less indiscriminately to the standard deviation. This 
convention seems to deri^ from the general prestige of the si^na, and 
the habit of its emplojTnent for other purposes, rather than from a scru- 
pulous examination of its adequacy as a descriptive device. 

However, the SD, and secondarily the AD, carry.,a supplementary func- 
tion of locating the relative position of an item within the distribution 
\Tith the aid of the table of normal areas. Ob\nously in such instances a 
normal distribution must be assumed. If the pattern of dbtribution is not 
at least approximately normal, one should consider the alternative em- 
ploj’meot of centiles and other positional measures in the place of the sigma. 
When, however, more advanc^ computations are anticipated, the sigma 
is inescapable. It is used as & standard unit of measure in comparing 
otherwise non-comparable data, and it is presupposed in correlation and 
measures of EampUng reliability. 

To the extent th^t empirical sociological data are not normally distrib- 
uted, the sigma may possibly have less utility in sociology' than in the 
descriptive statistics of other disciplines; but this "handicap” does not 
carry over into sampling, for reasons which will be made clear in subse- 
quent chapters. 


Questions and Problems 

1. Define the following concepts: 

Normal Frequency Distribution 
Normal Area 
Normal Ordinate 
Standard Deviate 
r-Measure 

Bell-Shaped Distribution 

2. Evplain in •Rtat sets* the "T»Tiiod”vuJTe is normal. 

3. A given ^riate lies one SO above its mean. Express this variate as a standard 
deviate. 

4. The range of a group of adult weights b lOOpounds; the range of their heights 
b 12 inches. How might these ranges be made comparabte? IVIut data 
would be required? 


175 



yARIATlON 


Table 6.4.2 

Computation of Standard Deviatea 



be read directly from the table, H may readily be calculated by subtracting 
the table entry from 50 per cent, as in the above example. It should be 
emphasized that, although sigmas may be mechanically computed for any 
distribution, whether normal or not, the foregoing interpretation of sigma 
measures is valid only for the normal distribution. 

Choice of Measure of Variation. The r-measure is not the only standard 
measure in the most general sense of that term. Centiles, such as the 
median and the quartiles, are also independent of the concrete data and 
the measuring scales, and are therefore equally deserving of the title 
"standard measure.” In fact, since they are more generally applicable, 
they are perhaps even more deserving of that appellation, because their 
meaning does not depend on the form of the distribution, as is true of 
r-meosures. This is merely one of many instances of a generic term being 
taken over in a specific setting and assigned a specialized meaning. We 
therefore remind the student that each type of measure of variation has 
its appropriate function, and that familiarity with both its statistical char- 
acteristics and the substantive problem is necessary for an informed choice. 

When the data are quantitative, the chmee may depend on one or more 
of the folloiving overlapping criteria; (1) the objective of the inquiry; 
(2) whether the measure is used as a terminal description or as a preliminary 
to further computations; (3) the pattern of the distribution, whether sym- 
metrical or skewed ; (4) the degree of completeness of the source data. 

If our limited objective is only a quick impression of the scatter or 
density, the range may be quim satisfactory. However, a more reliable 
impre^ion of the essential compactness of a distribution is conveyed by 
thft mtigefs, swtb ns ttift VrAHtjofluVAe rreoge, oi Wic 

range These measures have the added versatility that their interpretation 
is completely independent of any particular pattern of distribution. In 
thb sense, they are “distribution-f^.” 

174 




§6.4 Nobmal Distribution as Patteek of Variation 

All summary de^^ational measures, of course, express the amount of 
deviation from a central X’alue. Therefore, they are useful companion 
pieces of information to the statement of the central value itself. If a 
plain and simple measure of the volume of scatter around a central value 
is desired, the average deviation recommends itself as the most forthright. 
The standard deNuation may be used for the same purpose, but it will not 
seix’e as well: in the first place, it is less comprehensible because of its 
more complicated derivation; and, secondly, the distortion produced by 
squaring reduces its descriptive x'alue. In spite of this, there are those 
who adhere more or less indiscriminately to the standard deviation. This 
convention seems to derive from the general prestige of the sigma, and 
the habit of its employment for other puiposes, rather than from a scru- 
pulous examination of its adequacy as a descriptive device. 

However, the SD, and secondarily the AD, carry^a supplementary func- 
tion of locating the relative position of an item within the distribution 
with the aid of the table of normal areas. Obviously in such instances a 
normal distribution must be assumed. If the pattern of distribution is not 
at least approximately normal, one should consider the alternative em- 
ployment of centiles and other positional measures in the place of the sigma. 
When, however, more advanced computations are anticipated, the sigma 
is inescapable. It Is used as a standard unit of measure in comparing 
otherwise non-comparable data, and it is presupposed in correlation and 
measures of sampling reliability. 

To the extent that empirical sociological data arc not normally’ distrib- 
uted, the sigma may possibly have less utility in sociolog>' than in the 
descriptive statistics of other disciplines; but this “handicap” docs not 
carry over into sampling, for reasons which will be made clear in subse- 
quent chapters. 


Questions and Problems 

1. Define the following concepts: 

Normal Frequency Distribution 
Normal Ares 
Normal Ordinate 
Standard Deviate 
z-Measure 

Bell-Shaped Distribution 

2. Explain in what sense the " normal” curve is oonnaL 

3. A given i-ariate lies one SD above its mean. Express this variate as a standard 
deviate. 

4. The range of a group of adult wrights is 100 pounds; tlic range of their heights 
is 12 inches. How might tliese ranges be made comparable? UTuit data 
would be required? 


175 



1 6^ Vasutiox or QcALrrATnx VARiiBtcs 


SEcno^^ Five 

Variation of Qualitative Variables 

Can QualiiaUze ranaJiOTi Be Mtotittedf The variation of qualitative 
variables cannot be measured in the same manner as that of quantitative 
data. Qualitative ^'ariables do not exist in magnitudes, and are not ranged 
on a continuum ^rith a lero origin; there is no central ralue, or total or 
mtermediate ranges. Hence, there are no devbtions or, of course, average 
de%'iations. 

But that is not to my that a group of qualitative items are necesarily 
identical, that there b only homogeneity and no heterogeneity. Two items 
differ when they do not possess the same attribute. Even though these 
differences are qualitative, it is still possible to de%’ise some mca.«ure which 
effectively summarizes them. But instead of measuring magnitudes, we 
count differences. 

Now it is a truism that the greater the number of differences among a 
set of items, the more heterogeneous is the aggregate, and consequently the 
more variation within it. Similarly, the smaller the number of differences, 
the greater the homogeneity within it, and the less the \-anation. There 
can be, for example, no sex difference at a stag party; but, in mixed 
company, there will alwaj*8 be a smaller or larger number of sex differences 
between indinduaU, depending on the sex ratio of that group. 1 1 appears 
reasonable, therefore, to base an index of qualitative \‘arution on the total 
number of differences among the items in the given set. It is only a ques- 
tion of (1) how to compute the total number of qualitative differences, 
and (2) bow to convert this total mto a meaningful index. 

To find the total number of differences, we count the differences between 
each item and every other item and sum these observed differences. For 
example, in a set of sLx boys and six girb, cach-of the six boys will differ in 
attribute from each of the six girb, thereby making a total of 30 sex dif- 
ferences. If there were nine boys and three girb, each of the nine boys 
would differ from each of the three girb, producing 27 differences. In a 
group of 12 boys, the obvious result of no differences would be obtained 
by multiplying 12 by wro. 

Eridently, the procedure for detennining the total number of actual 
differences reduces to the following rule: multiply every attribute fre- 
quency by every other attribute frequency and sum these products. For 
example, in a set of four CathoUcs, five Protestants, and six Jews, there 
would be: (4 X 5) (4 X 6) (5 X 6) = 74 differences. 

Index of Qualilatice Variaium (IQV). However, these observed differences 
take on meaning only in relation to the maximum possible number of 

177 • 



Vahution 


5. Oa the base line of a normal curr^ ugma unltj are equaL Would this be the 
case for a skewed curve? Do qoarUIe points divide the range of a normal 
distribution into equal intervals? 

6. Plot a normal curve as follows: 

(a) From the *ero origin placed at the midpoint of the horizontal axis, mark 
off three arbitrary dgma units ht both directions. 

(b) Divide each sigma unit into tenths. 

(c) Above each such division point, plot the corresponding ordinate (Table II, 
Appendix, p. 419). 

(d) Draw a smooth curve through these ordinate points. 

(e) From the resultant figure, eummarize what seem to be the main features 
of the normal curve. 

(f) Is it possible to determine visually from a graph, whether or not a dis- 
tribution is normal? 

7. Use Table I of the Appendix (p. 4 1 8) to determine the proportion of the normal 
area lying between the mean and folloiring standard deviates: 

di .67 ±1.96 ±2.33 ±3.00 

±1.00 ±2.00 ±2.58 

6. Find the proportion of cases between each pair of sigma points oa the base 
line of a normal curve. Represent resulte graphically. 

0.3 to 1.6 1.1 to 1.2 -2.53 to -I-2.5S 

0.3to-1.6 OOtol.O l.Sto30 

.lto.2 1.0 to 20 -2.3 to -1-2.3 

9. Between what two sigma points on the base line of the noncal curve do the 
Buddie 50 per cent of the cases lie? 

10. Explain why the proportion of cases between zero and 1.0 aigma is not equal 
to the proportion between I.O and 2.0. 

1 1 . The mean of a normal distribuUon b 75 and the standard devbtion b 3. 

(a) What proportion of values liee between 72 and 78? 

(b) What values are in the upper tenth of the dbtribution? 

(c) Approximately what proportion lies between 69 and 81? 

12. Of the incomes in a normal dbtribntion, 20 per cent are below $50 and 30 per 
cent are above $60. Detenmne tbe standard deviation and the mean of the 
dbtnbution. (Hint- First express $50 and $60 as standard devbtes). 

13. Prepare a “less than” cumulatioD of suicide frequencies from Table S.l.ld 
and plot tbe corresponding CFP. Uang the mean of 12.5 and tbe SD of 5.4, 
determine (a) tbe proportion of suiode rates between the mean and a point 
one SD above the mean; (b) the piopwtioa between X and a point one SD 
below; (e) the proportion between pirints one sigma above and below X, 
respectively. Wbat does thb result demonstrate as regards the normality of the 
frequency distribution of suicide reta? 

176 



§ 6.5 Varution of Qoautattve Variables 


In Evansville, Indiana, there were roughly 120,000 whites and 9,000 
Negroes, so that: 


IQV 


120X9 
64.5 X 64.5 


XlOO 


4.160 
= 26% 


Thus, as gauged by the IQV, there is twice as much racial heterogeneity 
in Indianapolis, which has a northerly location, as in the border city of 
Evansville. 

An examination of the arithmetic of the formula will disclose that when 
observed frequencies are expressed as percentages, which are sometimes 
more convenient, exactly the same results will be obtained. Thus, for 
Indianapolis, the equation would read: 


IQV 


85X 15 
**50X50 
“51% 


X 100 


Qualitative variation, as here defined, is a strictly statistical characteristic, 
and should not be confused with the socio-psychological state which char* 
aeterizes social disorganization, anomie, or social conflict. The degree of 
social disorganization may, of course, be related to the degree of statistical 
heterogeneity in regard to race, religion, ethnic background, or nativity, 
for this heterogeneity may be one of the conditioning factors in the attitudes 
of the population. Thus, it has been hypothesized that social tension in- 
creases as conflict groups approach equality in power, of which numerical 
parity is one element. The IQV is one tool for its measurement, and en- 
ables us to study more systematically such hypotheses. 


Questions and Problems 

1. Define the foUomsg concepts: 

Index of Qualitative Variation 
Maximum Possible DiSerences 

2. A given population ia 50 per cent male and 50 per cent female, and 70 per cent 
white and 30 per cent N^ro. Is it possible to represent both variables by a 
6iagte/<?r? f^wnmecrt, 

3. Is the IQV a standardized measure? E^Iain. 

4. Is it possible for a group of persons to have more than one IQVl Explain. 

5. At one college, 80 per cent of the students are men; at another, 67 per cent. 
Compute and compare ZQV’s. 


179 



Vabiatioj* 


differences. This maximum mimber occurs when all of the frequencies 
of the individual attributes in the set are equal. Therefore the hypo- 
thetical maximum may be obtained by equalizing the frequencies (i.e., 
obtaining the mean frequency), frequencies and computing 

products, and then obtaining the sum of all such products. In short, we 
(1) find the mean frequency, (2) square this result, and (3) take this square 
as many times as there are possible pairs of attributes. In the aforemen- 
tioned example of nine boys and three girls, the maximum possible number 
of sex differences in a group of 12 would be 6 (hoys) X 6 (girls) = 36, or, 
in this special case, the mean frequency multiplied by itself. 

The relative amount of variation may now be measured by the ratio 
between the observed number of differences and the hypothetical maximum, 
expressed as a percentage: 


Index of Qualitative Variation = 


Total Observed Differences 
Maximum Possible Differences 


X 100 
(6.5.1) 


In the above instance of 9 boys and 3 ptis: 

li X 100 - 75% 

Among the 15 members of the three religious groups alluded to above, 
the mean number of members is, of course, five. Multiplying each “5 ” by 
every other "5” and summing these three products, we find the maximum 
number of differences to be 75. The observed differences, as already 
calculated, equal 74. Hence, 

/QF.^XlOO-99% 

This index will always vary between zero and unity. If the numerator 
is zero, the index will likewise be zero, and will reflect the complete absence 
of variation. In the event of an equal division of observed frequencies 
of attributes, the numerator and denorainator will be identical, and the 
index wjU be 100 per cent, reflecting maximum heterogeneity, or variation. 
Intermediate degrees of hetergeneity will take on intermediate index values. 


Use of the Index. Tlus index can be used to compare, for example, the 
relative amount of racial homogeneity in two or more communities. In 
Indianapolis there were in 1950 approximately 363,000 whites and 64,000 
Kegroes. Therefore: 


I 


178 



Norming Operations 


Section^ One 
Htmtnsaty Norming 

The Keed for Norminy. A persoa does not ordinarily respond to an event 
as a raw, isolated fact, but reacts to it ratber in terms of some norm, 
whether expressed or implied, in bis social backfround. The individual 
observer thereby suppUes the mental backdrop against which the event is 
judged or interpreted. For example, an annual income of $5,000 is not 
perceived as a detached item, but rather in relation to a standard based 
on the experience of the obseia-er; 100 births in a community in a given 
year have meaning only in terms of a c<HDpari5on with such relevant data 
as the size of the population, the number of births of the preceding year, 
or the number of births in comparable communities. If the standard 
against which the comparison is made b not expressly stated, the observer 
will unwittingly proride one himself in accordance with hb own previous 
experience. There b, of course, the possibility that the individual will do 
this inexpertly, or will use some norm other than the one intended by the 
investigator who compiled the data, and thus may extract meanings from 
them that are inappropriate. 

The difficulties will be still greater when comparisons are made between 
two or more sets of values based on populations that are unlike in significant 
respects. For example, a comparison of the intelligence of Negroes and 
whites in the United States on the bass of mental tests would be spurious 
and misleading, since performance <m such tests b known to improve with 
level of education, which b lugh^ amoi^ whites. Likewise, an e\’aIuation 
of two or more colleges, based solely on a comparison of the average in- 
comes of the respective graduates, would be of doubtful validity. A per- 
son’s income b affected not only by the quality of hb college training, but 

181 



Vabiation 


6. A population ia distributed into four ethnic poups as follows: 


German 

French 

Swedish 

Irish 


Calculate the JQF. 


60% 

20 

15 

S 

100 % 


Selected References 


McCarthy, Philip J., Introd\uHm ta SlMMiad Reaaonint. McGraw-Hill 
Book Company, Ice , New York, 19S7. Chapter 5. 

Moroney, M J., Fact from Fifure3. Penfuin Books, Baltimore, 19S4. 
Chapter S. 

Yule, G. Udny, and M. O. Kendall, An /nCrodudion to lAe TAeory of Sla- 
iitlta, 14tb edition. Itafoer PubUahing Co., 1930. Chapter 6. 


180 



57.1 Elementart Norming 


cephalic index is a -whole-whole ratio between two cranial measures — 
breadth and length — for the purpose of distingubhing quantitatively the 
round and oval head shapes. Such ratios are multiplied by 100 to clear 
the decimal point. 

Cephalic Index - 100 

The familiar IrdeUigence QuolienI (IQ) is a ratio between the mental and 
the chronological age of the tested person, which penmts persons of various 
ages to be compared on intelligence. Thus: 


IQ = 


MenlalA^ 
Chronological Age 


It b 100 when the chronological age and mental age are exactly equal. 
Other common ratios employed in socio-economic anal3^b include: person- 
room, population-land, and child-adult ratio. 

Arithmetically speaking, ratio b a generic term, indicating a comparison 
by ^virion, implied or calculated. Therefore, any person may construct 
any ratio hb needs may suggest. However, the above examples are 
distlngubhable by their conventionality and practicality; these quotients 
as well as many others are widely understood and employed. They are 
not capricious comparisons for personal use; they are established devices 
which are used for the transmission of information, and they therefore 
take on the characteristic of a socially accepted norm. 


Raies. A rate b essentially an arithmetic mean. It b the average number 
of occuirences of one variable per unit of another. Thus, a rate of 20 
miles per gallon b the mean consumption of fuel in which the^nuleage 
would almost certainly vary from one gallon to another. The “batting 
average” could with equal lo^c be labeled a “batting rate” if it were not 
for the inertia of linguistic habits. A batting average of .300 co^d be 
interpreted to mean three-tenths of a hit per trip to the plate, 30 hits per 
one hundred trips, or 300 hits per 1,000. Since aU rates are based on past 
observations, they provide a theoretical expectation for the future. Hene^ 
certmn calcubted rates and averages are sometimes referred to as expected 
■values. 

Being essentblly a condensation of a large number of observations, the 
rate has proved to be an effective measure for social analysis. A brge 
number of rates have become conventional in the field of sociology, the 
marriage rate, the crime rate, birth and death rates, and innumerable 
modifications of these rates, which are toob in every sociological workshop. 

Basically, a rate b a statbtical compound of two variables: the problem 
roriable and the norming tariable. Intte construction of rates, the sele^ion 
of the norming variable, to which the prr^lem variable will be related, is 

183 



Nobutxo Ophratjons 


also by his ability, and "connectjons” — traits that arc not uniformly 
distributed among college populations. 

One of the important functions of the statistical method is to furnish 
the techniques by which single quantities may be properly interpreted, 
or two or more meaningfully compared. There are many devices by which 
these objectives are accomplished, several of which arc assembled and 
discussed in this chapter. Collectively, they belong to a family of pro- 
cedures which we may call norminj operations, since they set up an appro 
priate statistical norm in terms of which the raw data are expressed and 
thereby rendered comparable. 

The process of norming is already familiar to the student from his 
experience with standard (sigma) deviates, whereby diverse rariables (in- 
come, age, and so on) are made comparable by c.xpressing each as a relative 
distance from its own mean. Similarly, the coefficient of relative variation 
renders two or more standard deviations comparable to one another by 
expressing each as a percentage of its mean! Here we shall analyze certain 
other forms of this important type of statistical reasoning as they apply 
to sociological materials. In rough order of complexity, we may norm 
by: (1) percentages, (2) ratios, (3) rates, (4) indexes, (5) subclassification, 
and (0) standardization. 

Pereenfaffet, The simplest form of norming is the reduction of a series of 
absolute figures to a standard numerical compatible with the habits 
of thought established by our decimal system. Cumbersome and confus- 
ing absolute, figures are therefore customarily expressed ns so many per 
hundred (‘‘per cent”), per thousand (“per mill”), per million, or in some 
other multiple of ten. Instead of quoting the enrollment of men and women 
college students as 0,244 and 4,022, wc convert these values into G(1.7 and 
33 3 per cent respectively. So universal has this elementary practice be- 
come, that the fundamental principle on which it rests is not fully appreci- 
ated by the layman unless circumstance compels him to examine the 
logical foundations of Ids habits, as when the traveler is forced to convert 
miles to kilometers, dollars to pounds, meters to yards. 

Balios. Two values may be compared in ratio form, or the one expressed 
as a multiple of the other. Ratios vary in composition: they may be part- 
part ratios of frequencies within the same set, or whole-whole ratios be- 
tween the frequencies of two selected variables. 

Thus, the sex ratto may be viewed as a part-part ratio; it compares the 
number of males in the population to the number of females. In 1950 
there were 74,200,085 males in the United Stales, and 76,010,025 females. 
The ratio between these bulky numbers is much more easily retained when 
it is expressed in the form of 97 males to 100 females, or simply 97, as it 
would be conventioDally quoted. In the field of anthropometry, the 
182 



§7.t Elementakt Nosuins 


Eipuld logically consist of those individuals that could possess the problem 
attribute — collectively known as the “exposed ” group. While death may 
occur to anyone in the population, births, marriage, and divorce may not. 
Hence, a rate based on a judiciously selected exposed population is less 
subject to distortion by extraneous factors. Birth rates, marriage rates, 
and divorce rates calculated on a total, unselected population are there- 
fore usually referred to as a “crude” rate; others are labeled “specific” 
or “refined." The ad\’antage of the crude rate lies in its ready compre- 
hensibility, the economy of tradition, and its utility for rough unspecialired 
purposes. The advantage of the specific rate lies in its precision and its 
serviceability in technical and profesdonal research. 

The Index. In statistics, the Index, a term which is colloquially as well as 
technical]}' used for various types of measures, usually pertains to relatively 
complicated ratios or sets of ratios. As a derived measure, it is designed 
to express simply the variation in a given set of values which in the raw 
form would be quite unintelligible. In its more formalized version, it 
usually refers to a ratio between two values, one of which is taken as the 
norm, or amexpectation, against which the other is measured. 

Thus, the cost-of-li\Tng index compares the average of prices in a par- 
ticular year with the average for the "normal” year. An index of 130 in 
1955, on a base year of 1949, indicates that the cost of living has Increased 
30 per cent over the base year taken as KM. TMiile such a deceptively 
simple index may be glibly quoted by every columnist, nevertheless its 
internal composition in coverage, weighting, method of averaging of obser- 
vations— as well as the choice of the base period — all testify toits statistical 
complexity. Analogously, the IQV (Chapter 6) compares the observed 
number of differences among attributes in a given set with the maximum 
possible number of differences, which in this case serves as a norm. 

Table 7.1.1 illustrates the construction and use of an index for the 
measurement of the social stratification of university students, and the 
under- and over-representation of the various social classes. The logic 
of the indexes constructed in this tabulation is as follows; daughters of 
professional parents comprise 19.3 per cent of the total female student 
l^y, whereas in the state of Indiana, the professional group itself comprises 
only 4.7 per cent. The relationship between these paired percentages 
would measure the differential class opportunity. There are two procedures 
by which we could measure the differentiala: (1) by the simple discrepan- 
Qes m percentage points; or (2) by the norm^ indexes. 

fn respect to the first alternative, an examination of the discrepancies 
reveals a pn^ressive change from excess to deficit, as we proceed down the 
^orial scale. However, these ^mple differences are absolute differences, 
m that they are not normed for magnitude or origin. Just as a loss of $5 
from Si, 000 represents a smaller rcloXire discrepancy than a loss of SSfrom 

185 



NoRUINO OfERATtOSS 


criticaliy important. For sample, in computing the birth rate, it b 
necessary to decide on the choice of the nonning population to which the 
absolute number of births is to be compared. This could be the total 
population, the number of women of child-bearing age, or the number of 
married women of child-bearing age. The most commonly quoted birth 
rate — though it is not the most discruninating — is the crude rate based 
on the total population: men, women, and children. 

A second consideration in the coostmetion of a rate involves the choice 
of a standard numerical base: 10, 100, 1,000 or multiples thereof. The 
function of the numerical base is merely to clear the decimal pLaccs for 
tabular convenience and facilitate quotation and quick understanding. 
However trivial this operation may seem, the numerical base is often fi^ed 
by convention, and one has no choice other than to comply with presmiling 
usage. Thus, a birth rate, quoted as 2J, would be internationally under- 
stood to mean 24 births per 1,000 general population in a given year and 
territory. It is more re.adily perceived, and more securely retained, than 
708 out of 32,402. Its calculation would have been n.s follows: 

Number of births « 768 

Total population •• 32,462 ^ 

Numerical base ■ 1,000 

Birth 1,000 - H 


The generalized formula would therefore read: 

Rste - Froblcm Variable 

" frequency of Norming Vnrbble^ 


Numerical Base 


In notation: 


Rate-^.XNB (7.1.1) 

where PF — problem frequency 
NF = norming frequency 
ys — numerical base 


Occasionally, neither the norming variable nor the numerical base is ns 
solidly founded in convention as is the case with general birth and death 
rates. In such instances, a certmn discretion is permitted, the outcome of 
which must then be specified by the compiler. Divorce rales may be 
calculated on the general population, or the number of marriages in the 
same year and ares, or even on the number of marriages during the preced- 
ing ten years, which account for most of the divorces. Delinquency rates 
may be calculated for specific age and sex groups; marriage rates on the 
age group of 14 years and over. 

From previous illustrations, we may discern that the norming variable 

184 



$ 7.1 Eleuzntabt Kobutko 


of the series. Assume, for example, that the mean death rate of a set of 
cities is 10.5. If all factors operating in the production of death rates in 
the respective cities were uniform, all the death rates would of course be 
identical, and therefore equal to the mean of the scries. TTe state that the 
mean is the expected, or theoretical, value under these conditions. Hence, 
to measure the force of the differential factors, the individual rates are 
measured against the mean. For example, if a city has an obser\'ed death 
rate of 7, we would compute the index as follows: 


Expected Rate 

= 67 % 


{7.1.2) 


This means that the given city has a death rate that is 67 per cent of the 
grand average. By this technique, any city could at once locate itself on 
the scale relative to the general mean. This device of norming is analogous 
to the CRV in that it expresses the raw value as a multiple of its own mean. 

The leosonal tndn, which is not elaborated here, is similarly calculated 
u a ratio between a given monthly measure and the average measure for 
the twelve-month series, and is used to measure the fluctuations in births, 
deaths, production, and certain other economic indices. 


Question’s and Peoblems 

1. Define the following concepts: 

Xonaicg Operation 
Ratio 
Percentage 
Rate 

Problem Variable 
Xonaing Variable 
Kumerical Base 
Exposed Group 
Index 

Expected Value 

2. la 1P30, Indianapolis a population of about 400,000, and me aaiaber cf 
births in that year was 7,672. What was the birth rats per 1,000? 

3. In the eighteenth eentuiy, European churches maintained records of births 
mi deaths, which were later used by population scholars to estimate the 
riles of cities. Thus, in Berlin, about 1700, there were 173 deaths. The esti- 
Ctated ratio of deaths to total popnlatioa was 1:35. CWcuIstc the death rate 
per 1,000. Estimate the sixe of the dty of Berlin of that period. 

•ri Between 1925-1935. the birth rate dropped from 25 to 19. CaleuUte toe 
P«=31»Se-point decrease (the decrease per 1,000); the percentage drerea.*. 

J87 



Noiutixo Operations 


Compulaiion of Index, Son'o-Economic CTa«i;/ico/ion, 
TaWe 7.1.1 H'ornm SJudwis and State Pcpu/aO’on, Indiana UnivcT$iiy 
{1950) and State of Indiana {IffiO) 



l.VniANA 

Universitt 

Ikdiana 

1910 

Per Cent 

Dirr. 

% Points 

01 


N 

Per Cent 


403 

\0J3% 

4.7% 

14.6 



S07 

2i0 

4.7 




200 

13.7 

5.0 




2$G 

13 5 

ns 

.7 



I9G 


15.9 

-66 



267 

12.6 

17.0 

-4.4 



9t 

4.5 

19.9 

-IS.4 



65 

3.1 

200 

-16.9 



2.113 

1000% 





Sourw Kit* »Md Jolin It Muetter. ‘■Ctm 6ini«Wr« lod A«»<J»Diie Si»em»,’' 
and Fti/cktlcfint Mtttufmtnl. Xtf t. I9S3. p 4SS. 


SIO, SO the differeDces between the percentage points must not be inter- 
preted uniformly without reference to the ba»s from which they are 
calculated. 

This need for a nonned evaluation is satbfied by the index, the second 
proposed alternative. Thus, if attendance were evenly distributed among 
all the classes of the population, one would of course "expect” the pro- 
fesaonal group, which amounts to 4.7 per cent of the general population, to 
supply 4.7 per cent of the student body. The expected proportion of stu- 
dents and the observed proportion would in that case be identical, and the 
ratio between them would be unity, and set equal to 100. In actuality, 
the professional segment of the university student body b 19.3 per cent, 
which b X 100 - 411 per cent of the corresponding state percentage, 
or 4.11 times the expected value. In like manner, we norm all other socio- 
economic percentages on their own respective proportion. A glance down 
the column of indexes will immediately and accurately attest to the pro- 
gressively declining representatiou of the respective social classes as we 
proceed from high to low. Thb b an effective consolidation of a large and 
somewhat complicated array of data in the general held of social stratifica- 
tion. 

Another approach to the formulation of an index would be the norming 
of a series of values on the mean of the raries. The index, in that case, 
would be simply computed as the ntio between a given value and the mean 
186 




§7.1 Elementabt NoRiONa 


of the series. Assume, for example, that the mean death rate of a set of 
cities is 10.5. If all factors operating in the production of death rates in 
the respective cities were uniform, all the death rates would of course be 
identical, and therefore equal to the mean of the series. We state that the 
mean is the expected, or theoretical, value under these conditions. Hence, 
to measure the force of the differential factors, the individual rates are 
measured against the mean. For example, if a city has an observed death 
rate of 7, we would compute the index os follows: 


Expected Rate 
7 

“ 10.5 ^ 

= 67% 


rXlOO 


(7.1.2) 


This means that the given city has a death rate that is 67 per cent of the 
grand average. By this technique, any city could at once lo^te itself on 
the scale relative to the general mean. This device of norming is analogous 
to the CBF in that it expresses the raw value as a multiple of its own mean. 

The ttastmal index, which is not eUboiated here, is similarly calcuUted 
as a ratio between a given monthly measure and the avers^ meas^e for 
the twelve-month series, and is used to measure the fluctuations in births, 
deaths, production, and certain other economic indices. 


QiresTioKS AMD Problems 

1. Define the following concepts: 

Konning Operation 
Ratio 
Percentage 
Rate 

Problem Variable 
Nonnmg Variable 
Numerical Base 
Exposed Group 
Index 

Expected Value !,■ 

I nf about 400,000, and Ibe number of 

Ite IMI" I”' 

births m that year was 7,672. »>n»» 

chuidies maintained records of births 
3. In the eighteenth century, population scholars to estimate the 

and deaths, which im there were 178 deaths. The esti- 


sixes of cities. Thus, in 


1 :35. Calculate the death rate 


mated ratio of deaths to t^tsl ^ Berlin of that period. 


per l.oobr Estimate the sixe of the aty 

. rate dropped from 25 to 19. Calculate the 

4. Between 1925-1935, the p« hOW)); the percentage decrease. 

* 187 


pennill^o-point decrease ( 



Nomiinq Operations 


TaMe 7.1.1 


Compu^aiion of Index, Soeio-Economtc Classification, 
n’omcn Sludenh and State Population, Indiana Unh'CTstly 
(J950) and Slott 0 / fndiano (19^0) 


Socio-Economic Class 

IkoiaKa 

UKIVERSITir 

Indiana 

1910 

Per Cent 

Birr. 

% Points 

Index 

N 

Per Cent 

Professional 

40S 

19.3% 

4.7% 

14 6 


Proprietors, Managers . 

507 

24i) 

4.7 

19.3 



290 

13.7 

5.0 



Clerks 

2S6 

13.5 

12.S 

.7 


Farmers 

196 

9.3 

15.9 

-6.6 


Skilled and Foremen . . . 

267 

12.6 

17.0 

-4.4 


Semi-skilled . . 

94 

4.5 

19.9 

-15.4 


Unskilled ... 

1 65 

3.1 

20.0 

-16.9 



m 







8oufe» Ki«« 4Bd John H. Mu«ll«f. "Clia* StnieWt* tnd Aeid«ml« EJueeltenM 


ond AJ<«*ur<m<n(, XIII. P- 4SS. 


SIO, SO the differences between the percentage points must not be inter- 
preted uniformly without reference to the bases from wJiich they ere 
calculated. 

This need for a normed evaluation is satisBed by the index, the second 
proposed alternative. Thus, if attendance were evenly distributed among 
all the classes of the population, one would of course “expect” the pro- 
fessional group, which amounts to 4.7 per cent of the general population, to 
supply 4.7 per cent of the student body. The expected proportion of stu- 
dents and the obserred proportion would in that case be identical, and the 
ratio between them would be unity, and set equal to 100. In actuality, 
the professional segment of the uni^cTsity student body is 19.3 per cent, 
19 3 

which is X 100 411 per cent of the corresponding state percentage, 

or 4. 1 1 times the expected value. In like manner, we norm all other socio- 
economic percentages on their own respective proportion. A glance down 
the column of indexes will jnune^lely and accurately attest to the pro- 
gressively declining representation of the respective social classes as we 
proceed from high to low. This is sn effective consolidation of a large and 
somewhat complicated array of data in the general field of social stratifica- 
tion. 

Another approach to the formulation cd an index would be the norming 
of a series of values on the mean of the series. The index, in that case, 
would be simply computed as the ratio between a given value and the mean 
186 




5 7^ SUBCLASSinCATION AND StANDABDIZATION 

5. In a city of 500,000, the sex ratio is 92. Calculate the number of males. 

6. If a college has as enroUment of 1,236, of which 692 are males, what is the sex 
ratio for the school? Explain briefly what this ratio means. 

7. Suicide rates are calculated on a numerical base of 100,000. In the years 
1945-1952 there were 1,055 attempted suicides in Seattle (Eing County); of 
these 46S were completed, males completing 353 and females 115. The sex 
ratio is 100 in Seattle's population of 732,992. WTiat is the annual suicide 
rate of each of the following: males, females, and total? Calculate the ratio 
of attempted to successful suiddes. 

8. In 1939, coflee cost 22.3 cents per pound. In 1949, coffee was 52.4 cents per 
pound, and in 1957 its price had risen to 79 cents per pound. If 1939 is con- 
sidered the base yearj what would be the price index for 1949 and 19577 Inter- 
pret your answer. (Source: U.S. Bureau of I^bor Statistics ) 

9. The sex ratio in City A is 100, in City B, 50. The total population in the two 
cities is identical. Does City A have twice as many men as City B? Explain. 

10. (a) Compute indexes of Negro and while participation in relief for each city. 
(Table 7.1.2) 

(b) Array Negro indexes forNorthemandSouthere cities separately; calculate 
median of each array and compare. 


Section Two 

Subclojsification and Standardixjttion 

Norming hy Subclassificatim. Just as a single absolute value is devoid of 
meaning unless related to an appropriate norm, likewise a single rate or 
percentage possesses little if any sociological meaning when standing alone. 
It acquires meaning only when placed side by side with analogous rates for 
purposes of comparison. Accordingly, we compare the birth rate of one 
state with that of another, the suicide rates of Negroes and whites, the 
marriage rates of Catholics and Protestants. Such comparisons presumably 
reflect differences in fertility by state, in propensity to suicide by race, and 
in proneness to marriage by religion. However, the natural inference that 
there is an association, or even a cause-and-efTect relation, between such 
paired variables should not be too hastily made, because the observed 
variation among such rates may be due at least in part to the operation of 
factors other than those explicitly identified in the classification, but which 
still influence the events under observation. Such factors are therefore 
labeled concealed factors, for they do th«r work invisibly in the tabulation 
before us. Many comparisons between two or more sets of ob3er\'ation3 
are disturbed by the operation of such concealed factors, which distort 
the differences and lead to false inferences. Thus, the higher birth rate in 

189 



NoRinNQ Operations 


Families on Rtli^, White and Negro, Selected Cities of 
U.S., J9S4 


Relief PoniLATioii 1934 I Censps 1930 


Akron, Ohio. 

Ansonia, Conn. 
Atlanta, Ga. 
Baltimore, Md 
Benton Harbor, Mich. 

Biloxi, Miss. 
Birmingham, Ala. 
Bowling Green, Ky... 
Charleston, S.C... 
Charlotte, N.C. . . . 

Chicago, 111 

Cincinnati, Ohio 
Cleveland, Ohio 
Detroit, Mich. 
Evansville, Ind, . . 

Gastonia, N.C. . 
Houston, Tex. . . 
Indianapolis, Ind. 

Jackson, MUa 

Kansas City, Mo. . . 


Lake Charles, La. 
Lakeland, Fla. . 
Lexington, Ky.. . 
Little Roi, Ark.. 
Los Angeles, Calif. 

New Orleans, La.. 
New York, N.Y. 

Norfolk, Va 

Kttaburgh, Pa 


new Orleans, La J4 gj2 

New York, N.Y. . . 272 8S0 

Norfolk, Va 3^750 

Kttabu^, Pa 44'996 

Ubbak TJnited State s 2,019,949 
Sour'c*. Ongiual d»u. unpubitthsd. 


8,565 80J2 19.7 

632 83.1 15.9 

18,718 38.5 61.5 

40,880 56.1 43.7 

819 80.5 19.4 


1 Census 1930 | 


% 

ITAjIs 

% 

Negro 

62,557 

96.0 

39 

4,602 

94.1 

58 

67,749 

65.4 

34.6 

193,991 

82.8 

17.1 

4,133 

93.8 

6.0 

3,645 

80.2 

19.7 

64,263 

5S.9 

41.1 

3,332 

78.2 

21.8 

16,693 

46.8 

53.1 

19,243 

66.7 

33.3 

842,578 

92.9 

6.5 

122,511 

89.6 

10.3 

221,602 

91.9 

7.9 

370.293 

92.6 

6.9 

25,716 

93.3 

6.7 

3,697 

78.2 

21.8 

75,408 

73.7 

22.5 

98,610 

87.8 

12.1 

11,065 

56.6 

43.4 

108,641 

88.9 

10.5 

3,834 

61.6 

333 

5,040 

78.7 

21.3 

12,026 

67.7 

32.3 

20,026 

74.3 

25.7 

368,508 

90.3 

30 

111,936 

68.9 

30.8 

1,722,954 

95.3 

4.5 

31,859 

63.7 

36.0 

155,079 

91.6 

8.3 

17,372,524 

91.3 

7.6 


188 


NoRMiyo Operations 


one state may be due not to the greater fertility of its population, but rather 
to its larger percentage of women of child-bearing ages, an adventitious 
factor which is not manifest in the quoted crude rate. 

An illustrative test of that pos»bility is provided by an analysis of the 
crude over-all birth rates of metropolitan California and rural Kansas, 
each quoted at 14 8 in 1940. At first glance, this identity of rates is rather 
startling, since rural populations are known to be more fertile than urban 
areas We may suspect that the population of California has a significantly 
larger proportion of women in the child-bearing ages than that of Kansas. 
In order to determine preliminarily whether the concealed factor of age 
has contributed to this strange outcome, we would have to prepare a table 
in which men are excluded and the age factor is identified. We would 
therefore: ( 1 ) stratify the respective populations by sex and age to deter- 
mine the proportion of women of child-bearing age; (2) subclassify these 
women by age in appropriately small intervals; and (3) compute the age- 
specific birth rates for these intervals. Stratification reveals that the pro- 
portions of child-bearing women in CalUomia and Ivansas are 28.8 per cent 
and 24.1 per cent respectively. The age-specific birth rates of these groups 
turn out to be as shown in Table 7.2.1. With one insignificant exception 


TabU12.\ 

Ape-Spfcytc Birth Baits, 
Baral Kansas and Cali- 
fornia Ciius of 100,000 
and OifT, IQ^O 


Aoc Grocp 

AcE-SrEOMC Birth Ratb 

Urian 

Coltfomta 

Kansas 

15-19 

42.6 

32.0 

20-24 

1202 

127.1 

25-29 

99.8 

126 0 

30-34 

6S.6 

S48 

35-39 

24.2 

52.3 

40-44 

5.7 

20.1 

45-49 

.5 

2.0 


By petmmoB InKs T*bl« 8 4, p. ISO. Pepulalien Pn>>- 
bm*. 4Ui ol., by W«TTen 8. Thompna. Copynstit 19S3. 
McGraw-nOl Boob Compuiy. lae. 


(age group 15-19), the age-specific birth rates of rural women are in fact 
higher than those of metropolitan centers, thereby sustaining the hypothecs 
which the subclassificatioQ was undertaken to test. It was these differences 
which were concealed in the crude rates, and which this procedure has ex- 
posed. These lower age-specific birth rates in California are, however, 
compensated for by an appreciably larger proportion of women of child- 
bearing ages, which then explains flie equality of the crude rates between 
urban California and rural Kansas. The women of California are less 
fertile but there are many more of them. 



§ 7^ ScBCLiSSITICATIOS AXD STASDABDIZATIOK 


Norraing by subclassificatioQ is freqaently referred to as holding faelora 
constant, and represents a familiar approach, colloquially expressed as “all 
other things being equal.” Extraneous factors must be held coastant in 
order to eliminate their disturbing effects. A death rate, for example, is 
usually interpreted as a measure the vital health of a community. How- 
ever, if a communitj’ with a relatively hi^ death rate happens to have 
an older population than another community, that rate cannot be inter- 
preted as a measure of vital health, but must be ascribed to the historical 
accident of an older population. Such a loading is quite irrelev-ant to the 
health conditions we wish to measure. The low death rate in the United 
States has not been altogether due to superior national health, but partly 
to the fact that the American population consists of such a large pro- 
portion of young persons who are not in the most fatal period of life. 

Similarly, a comparisop today between the crime rates of the native-born 
and the foreign-bom populations in the United States (Table 7.2.2a) is 
disturbed by the fact that crime, like death and marriage, is age-linked. 
The foreign-bom are concentrated in the older groups where crime is less 
prevalent, whereas the native-born are concentrated in the younger groups 
■where crime is more prevalent. In 1950, the median age of the foreign-born 
was roughly 56, the native-born, only 31 years. If the crude rate is taken 
as a measure of propensity to crime, then the age distribution of the native- 
born, which is hea\'ily loaded with younger persons, should not be permitted 
to work against them. Somehow, the differential effect of the irrelevant 
factor of age will have to be removed in order to permit a valid comparison 
between the two populations. The somewhat lengthy procedure by which 
euch a removal is effected is set forth in the next set of tables. 


Crude Crime Rates, Uaiive-Bam 
Tdblei:iJ.z and Foreign-Bom Populalions, 
Ages t5-75 


NATiyriT 

PopouiTios CaniES 

Canm Rate 
PEB 1,000 

Xatire-bom 

Foreign-bora 

30.000 125 

20.000 72 

42 

36 


Source: Hj-pothcbcal 


Table 7.2.2a presents the native-bom as having a higher crude crime 
•^te (4.2) than the foreign-bom (3.6). It is a plausible hypothesis, how- 
e’er, that the higher rate for native-born is a result of their younger ag» 
lather than a greater predisposition to crime. In order to investigate tto 
ypothesisj it is necessary to make comparisons for each age group sepa- 

191 




Normwo Operatiovs 


rately, which in turn requires that we subdasstfy offenders by age; that 
is to say, we norm by subclassijicaium. This is done in Table 7.2.2b. 


Age-Spfcijie Crime Rates, lfalive~Bom and Foreign-Bom 
Table 7.2.2b Population 





■ 



mjj^M 

























B9 




EM 



^51 





We have now put ourselves In a position to answer the query: “Is the 
conclusion Impli^ by the crude difference between rates confirmed within 
the more detailed categories?” We should recall that the native>born 
have a higher crude crime rate than the total foreign-born by .6 of a per> 
millage point (4.2 - 3.6), or 16J per cent X 300^. But when compari- 
sons are drawn between matched age categories — that is, when we norm 
on age — we find the relative portions of the native-born and the foreign- 
bom reversed. Thus, our hypotheas is sustained. The native-born now 
show equal or lower rates in each age group; significantly, they have 
lower rates by 20 per cent and 25 per cent for the age groups in which 
crime rates are highest. Yet their over-all crude rate is higher. This 
seeming inconsistency is again explained by the differential age loadings: 
27 per cent of the native-born, but only 10 per cent of the foreign-bom, are 
under 25 years of age — the crime-bearing age. We could similarly have 
subclassified by sex, nationality, religion, race, or any other variable which 
showed promise of explaining plausibly the results in hand — always 
assuming, of course, that the additional data are available for such re- 
tabulations. 

Normtng by Standardization. It is evident from all this that one cannot 
generalize to the total groups from the comparison of matched subgroups. 
Not only are individual comparisons of unequal weight, but in addition 
they usually display considerable variation: observe, for example, the 
variation among the differences between the age-specific birth rates of rural 
and urban women (Table 7.2.1). A method must therefore be devised to 
192 





§75 SUBCLASSmCATION AND StaHDABDIZATION 

obtain a Evtnple, quotable, over-all refined rate, \Thich conforms to the 
information yielded by the truer, specific rates, but which is free of the 
varied weights of the subgroups. This is the method of standardizalion, 
and the rate obtained will be the ttandardized rule. 

The student has already inferred that the higher crime rate of the 
native-born must be attributed to the higher proportion of young per- 
sons in that population. -It is this excessive loading of young people, 
rather than an excessive tendency to commit crime, that explains the 
higher crime rate of the native-born. This anal 3 rsis of the age-specific 
crime rates already answers our question about the difference in rates, 
with age controlled. 

But we do not yet have single rates for each of the two groups as con- 
venient and terse as were the now discredited crude rates. A valid com- 
parison between the single rates can be set up, free of the concealed effect 
of age — which is the devil in the works — if we could determine the 
number of crimes the native-born would have committed if they had 
had the same age distribution as the foreign-born. In other words, we 
must proceed as if the two populations bad the same age distribution, 
in order to equalize their exposure to crime. In short, we standardise 
on the age factor. The statistical procedure consists in joining the age- 
specific crime rates of the native-born to the age distribution of the foreign- 
bom, as shown in Table 7.2.2c. For example, where the native-born 


Slandardizcdion: Native-Born 
TahUl ^.2c Crime Rate Standardized on 

Foreign-Bom Age Distribution 


Aqb 

FoaClQN-BoRN NATUVn-BORN 
Porm.A'noK Cbuik Rate 

CautES 

Expected 

15-24 

2,000 8 

16 

25^ 

8,000 3 

24 

45-74 

10,000 2 

20 


20,000 3 

60 

1 StandardizedNaiive-BomRate = X 1000 

1 20,0UU 

' • 

= 3 

1 


age-specific crime rate is 8 per thousand, and the foreign-born popula- 
the hypothetical number of native-born crimes is 16. The 
of all such hj-pothetical frequenoes is 60. 


193 




Normjnq Oper-^tioss 


By reducing the absolute number (60) of crimes to a rate of 3 per 1,000, 
the native-born crime rate is standardized on the age composition of the 
foreign-born population. In effect, »e have given to the native-born the 
“ advantage” o! the age distribution of the foreign-bom. This eliminates 
the effect of the divergent age loadings, and thereby holds the factor 
of age constant. 

By this process, the crime rates have been reversed. The standardized 
rate of the native-born is now only 3 per 1,000, or .6 of a permillage point 
below that of the foreign-hom. The same general conclusion, a higher 
rate for the foreign-born, would have been reached had we standardized 
in reverse; the age-specific crime rate of tiie foreign-born would have 
advanced from 3 6 to S.3, an excess of 1.1 over the native-born crude 
rate (4.2). 

While standardization gives every appearance of being a neat and 
impersonal routine, there is no formula which prescribes how fine the 
subclassification should be, or for that matter what subclassification 
to employ. It therefore does not release the social analyst from critical 
subject-matter decisions. Thus, age could have been subclassified into 
more than three intervals. In fact, H could have been subdivided almost 
indefinitely, and the greater the number of subdivisions, the more pre- 
cise the comparison. However, there are practical limits beyond which 
subclassification need not be extended. In the illustration just given, 
the very coarse division of age into three intervals was enough to estaS 
lisb the importance of age as a factor in the crime rate, which was the 
limited objective. 

Outcome of Standardioalion. In the preceding example, the standing 
of the t%YO papulations vras reversed when age was standardized; the 
native-born, sbowing initially a hl^er rale, emerged alter Btandardiia- 
tion with a lower rate. But such an extreme reversal is perhaps ex- 
ceptional. In the following example, which compares the proportion of 
American men 15 years of age and over who were classified as married 
in 1S90 and 1950, the original difference is reduced when age is controlled, 
but not eliminated or reversed. 

In 1890, only 54.1 per cent of American men were classified as married 
(Table 7.2.3a), whereas in 1950 the percentage had risen to 68 9 (Table 
7.2.3b), an increase of 14.8 percentage poiuta. We ask: Does this ap- 
preciable incre-ase in the prop<wtion married demonstrate an increasing 
propensity among American men to marry? Or, could it be due merely 
to a higher average age more favorable to marriage? Such a shift to a 
higher average age could induce a rise in the per cent married, with the 
baric propensity to marriage remaining unchanged. 

To determine whether the per cent married would have increased from 
54.1 to 68.9 had the age distribution remained unchanged, we need only 
194 



S 12 ScBCLASStncinox xnh) Staktuboiutiox 

Perctniage of MdUs Harried in Specified Age Groups, 
raWe7^.3a U.S.,IS90 


Ace 

Males 

Males Mareied* 

Kumber 

Per Cent 

Xunber Married 

Per Cent 

35-19 

3,243,711 

15.71 

16,746 


20-2i 

3,104,893 

15.02 

5S5,74S 

18.9 

25-29 

2,698,311 

13.03 

1,421,407 

52.7 

30-31 

2,425,6&4 

11.73 

1,728,930 


35-44 

3,705, &1S 

17.92 

2,997,030 


45-51 

2,627,024 

12.71 

2,213,901 


55-C4 

1,630.373 

7.S9 

1,342,414 

82.3 

65 and over 

1^33,719 

5.97 

869,925 


Total 

20,674,343 

lOOXM) 

11,176,101 



* EkIuiSm vidowrd ftnd dtivrcvd 

Sourc*: Bureau of tb* Oraeua. VJS. Ce*$ut »/U« PeptJlatvn.- }9S0, Vol. tl, ClUrartrr* 

utiet •/ IA4 ro^ofwn. Part 1. Caverd SloUt Syt^Kary. Tabt« 103, C^- Coyrremcal Prut* 
!»• OS««, Wubuiften, D.C*. 19U. 


Peretntage of Males Married in Specified Age Groups, 
TahU12.^b U^.ISSO 


Age 

Males 

Males .Makkieo 

A'«m5fr 

Per Cent 


IQQS 

15-19 

5,323,470 

9.95 

166,935 

3.1 

2f>-24 

5.559,265 

10J9 

2,217.810 

33S 

25-29 

5,904,975 

11.04 

4,531,375 


30-34 

5,562,315 

10419 

4,690,995 

84.3 

35-44 

10,402,193 

10.44 

9,046.675 

87.0 

45-54 

8,484,515 

I5.S6 

74;C7,615 

85.7 

55-04 

6,5(0.100 


5,320,670 

S1.4 

65 and over 

5,7344:50 


3,767,300 

C5.7 

Total 


10000 

36.S59.395 



Sourr*- See Table 7.3.aa. 


hold age coRstant by applying the aRe-spceifie married rat« in 19o0 to 
the age dUtribution of males tn IS90 (Table 7,2.3c). Tlie remlling stand- 
ardiied rate (or 1950 » C2.9 per cetit, which ie 0 0 pem-ntage pomw 
lo«cf than the oh*er\-cd crude rate of 1950, but yet 8 S perrenlase po:nti 

195 




























NoaioNO OpEEJtTiOJra 


Compuiaiion of SUmdardued Raiet Percentage 
Tflble72.5c 0 / Males Mamtd in 1950 Sfandordized on 1890 
Age DUtribution 


Aoe 

Numbek or Pee Csht 

Males, 1890 Mabbied, 1950 

Expected f'TcnraEB 
or Marbuces 

1&-19 

3,248,711 

S.1 

100,710 

20-24 

3,104,893 

30.9 

1,233,852 

25-29 

2.693,311 

74.2 

2,002,147 

30-Sl 

2,425,654 

84.3 

2,044.835 

35-44 

3.705,648 

87.0 

3,223,914 

45-54 

2,627.024 

85.7 

2,251,300 

65-64 

1,630,373 

81.4 

1,327,124 

65 and over 

1,233,719 

65.7 

810,553 

Total 

20,674.543 

12,999,495 


1950 Staadardued ^^arri»ge Rale « ■ C2.95i 


Wg^er than the crude rate ol 1890. From this compulation, -we may 
infer that 41 per cent of the increase 100^ between 1890 and 

1950 was due to a change in the age dblribution, and 59 per cent 
X 200^ ms due to other, unspecified factors. 

^fandardiraticm of Means. Standardization U a remarkably versatile 
tool not liimtcd to simple rates and percentages. Any kind ol arith- 
metic average may be standardized, provided, of course, the requisite 
data on subgroups are available. For example, let us suppose that so- 
rority women at some college have a grade-point average of 1.98, while 
non-sorority women have a mean grade-point of 1.88 (Table 7.2.4a) 
From these averages alone, it would appear that the sorority women are 
scholastically superior. However, the non-sorority women might object 
to such an interpretation on the ground that sororities contain a larger 
proportion of upperclassmen, who normally receive higher grades than 
freshmen and sophomores. The superior grades of the sorority group 
may be attributable in part to the concealed factor of class composition 
rather than merely academic ability. If this hypothesis is plausible, 
why not standardize non-sorority grades on the class composition of 
sorority women as a testf 

Pursuing this suggestion, we rew^^l the class-specific averages. of 
1S>6 




57 2 SCSCUSSIFICATIOy AND STANDARDIZATION 

Grade-Point Averages by Class, Non-sorority and Sorority 
Table72.4z Women 


Class 

Non-soromtt 


(%) 

X 

/I 

(%) 

X 

tm 

Freshman 

40 

1.8* 

72 

20 

1.8 


Sophomore 

30 

1.8 

54 

35 

2.0 

mm 

Junior 

20 

2.0 

40 

25 

2.0 

El 

Senior 

10 

2.2 

22 

20 

2.1 

Wm 


100 

1.8S 

183 

100 

1.9S 



* Bued on a eradiDg (jiMm in irtueh pada-poinu per credit bour ars aa foUom; A — 3, 
B » 2.C - 1, D - 0. F - -1. 

Source: ETpotbetieal 


non-sorority women by the class percentages of sorority women (Table 
7.2.4b), In this reweighting, the grade average of non-sorority fresh- 
men is multiplied by tlje percentage of sorority women classiSed as fresh- 
men (1.8 X20 «■ 3C). In identical manner, we calculate the remaining 
products. The sum of such products, divided by 100, yields the sought- 
for standardized mean: the grade average of non-sorority women stand- 
ardized on the class composition of sorority women. 


Tai)U7:iA\s 

Computation of Standard- 
ised Mean: Non-sorority 
Average Standardized on 
Sorority Distribution by 
Class 


Class 

SoBoanr Nos-boboritt .y 

(%) m ’ 

Freshmao 

Sophomore 

Junior 

Senior 

207c 1-8 36 

35 1.8 63 

25 2.0 50 

20 2.2 44 

Total 

100% 1-93 193 

Standardized Grade-Point Average 

of Non-eorority Women - 

- 1.93 


The crude mean of the non-sorority group (1.8S), when standardized 
(1.93), is still somewhat below that of the eorority group (1.03). Thus, 
standardization has narrowed but not wiped out the original difference 
between the two groups. Even when^Ui*’‘';^;;^orority women are ac- 
corded the benefit of sorority class ' , ^ ■ j still do not quite 

/ 197 








KoBMts<3 OrEiunoxs 


attain the level of Forority houses ErMmtly other (actors Ix^iiIm cla<s 
competition have alTected the praile averaeCH. Fiieh conr^M of study, 
IQ difTerence^, and the like These traits, os well as others could he 
employed for an indefinite refinement of the data, but at the fame time 
they sucEC't the impracticaHty of rarrying standardization very far. 
The infinite potentialities of standardization remind us once again how 
remote is “ultimate truth" from the data which lie before us and on 
which belief and action must ne\-erthele«s be based. 

The Standard MtUion. Quite frcsjuently, the crude rates that are to 
be converted into standardized rates arc fcattered over a large territory 
and cover an cstended duration of time. Interstate vital statistics and 
international statistics involve cumbersome comparisons unless there is 
a generally recognized and avniLable standard. To satisfy that need on 
an international basis, the Standard Million of England, 1901, was often 
employed by common consent at the beginning of this centurj'. Since 
the age distribution of a population is one of the most distorting factors 
in the interpretation of vital statistics, this Standard Million was com- 
posed of the age distribution of the Pritbh population given ia terms of 
a million population. 

The UJS Jvaiional O/Jice of Vital Statistics at present employs, for 
the computation of its vital Statistic*, a Standard Million basM on the 
population ccn»us of April I. 19-10 (Table 7.2.5). Such a procedure 
standardizes the vital rates for age, thereby rendering them comparable. 


Table 7 a. 5 

Standard Million os Determined 
from the Populalion of the Untied 
Stales, Entimeraled as of April t, 

mo 


Aoe 

MiLUOV 

Under 1 jtar 

15,343 

1- 4 years 

WJIS 

5-H years 

170,355 

15-24 years 

181, 6n 

25*34 years 

162,066 

35-44 yesrs 

139,237 

45*54 years 

117,811 

55-61 j-enn 

80,294 

65-74 yesrs 

48.426 

75-84 years 

17,303 

85 years sod over 

2,770 

All Ages 

1,000,000 


Stntxm VS. D»pfcrtin»nt of Ileollh. Edo- 
nthm. sad WeUtPe. National OfSeo of 
Vital BtaUatica. I'Cat Slatitlici; Spfeial 
AamHt. Vol. 49. No. 94. May 1949. 
Oormrusaat Pnotinc OSea. Waabincton. 


198 



57^ ScBCLAssincATiox AND Stakdaxdizatio.v 


Norming of Cross-Tahvlaliom by Subelasiifieation. Cross-tabulations 
are usually set up in order to exhibit statistical association, but the face 
value of that association cannot be perftinctorUy accepted — again be- 
cause of the almost certain presence of concealed factors. In order to 
expose such concealed factors, cross-tabulations may also be subclas- 
sified. To illustrate such norming, \re manipulate data (fictitious) on 
52 community areas, cross-classified by race and by degree of delin- 
quency (Table 7.2.6a). 


Delinquency Rates by Race, SB Local 
TableT^.Ci Communities 



DELJVQTTEVCi R-ITE 

Race 

Number 

Per Cent 


High 

Lm 

Total 

High 

Lene 

Total 

Negro 

IS 

s 

23 

65 

35 

JOO 

IMiite 

10 

19 

29 

Si 

66 

100 

Total 

25 

27 

52 

•ts 

62 

100 


Souin Rrpo(b«u«4t 


The tabulation indicates that Negro areas are more often character- 
ized by high delinqtiency rates than are the white areas, thus suggesting 
a statistical association between race and delinquency. This a«sociation 
is brought out more clearly by the percentage distribution which reveals 
that 65 per cent of all Negro areas are in the “high delinquency" cate- 
KOrj', while only 34 per cent of the white areas are so classified. In N-iew 
of this crude difference between Negro and while neighborhoods, the 
conclusion of an association between race and delinquency seems justi- 
fied. 

However, such a bald conclusion without reservations, would be un- 
acceptable to anyone experienced in the social patholog>‘ of the city. 
He would point out that N^roes are concentrated in the blighted areas, 
^here the standard of living is low, whereas whites more often reside in 
the more favorably situated districts that enjoy a relatively high liring 
standard. It Is reasonable therefore to ask whether the difference be- 
tween Negro and white areas, as r^iards delinquency, would hold up if 
Negro and white areas were nonned on the same economic les'el. Im- 
plementing this line of reasoning, we would subclassify areas according 
to standard of liring, and draw comparisons within homc^eneous socio- 
economic Eubchfses (Table 7.2.6b). 


199 











Nobmihq Opebatjoss 


DeltTiquatcff Rates iy Race and Eco- 
Tahlel .2 .6b nomie Leiel 



EcoKoiiic Level 

Hroif SrAKDiiED 

Low SrAWDAKP 

'Delirujuetuy Rate 

Delinquency Rale 

High Loa 

Total 

High Lots 

Total 


1 6 

3 Ig 

7 

21 

U 2 

7 1 

16 

8 


4 24 

28 

21 3 

24 

PerCENTAOK IHSTRtBDTIOM 



1 


100 

100 

Total 

ilCTiH 

m 


100 


Inspecting the two partial tablet which display the partial atsoeia" 
lions, we find that in both tables the association between roce and delin- 
quency has vanished: of the 24 economically inferior areas, 21, or 88 
per cent, are in the category of h^h delinquency; and this is true whether 
the areas are Negro or white. Of the 28 areas designated as econom- 
ically superior, only 4, or 14 per cent, are in the high delinquency category; 
and this likewise holds regardless of race of area. 

In this contrived illustration, then, the incidence of delinquency is 
dependent altogether on economic level, and not at all on the race vari- 
able. The analysis has therefore demonstrated that the original differ- 
ence between Negro and white areas was due to the disproportionate 
concentration of Negroes in the economically depressed areas rather 
than the factor of race as such. Such an association is called tpitrious, 
since it is actually the effect of the concealed socio-economic factor which 
provides the “genuine" explanation. This distinction between spurious 
and genuine association is an important one, and will be discussed more 
extensively in Chapter 10. 

The uneven loading of Negroes and whites in the two broad economic 
strata is further clarified by Table 7.2 6c which rearranges the partial 
tables. Lifting an example, we leam that 15 out of 25 high delinquency 
areas are Negro, and of these 15, 14 are on a low economic level. 

The Function of Suicla»»ificaltoi». Baaicaliy, eubclassification is a pro- 
cedure for refining comparisons. It is a device for holding a variable 
200 







§7.2 StmciassincATiON and StaKdasdization 

DeUnqtuney Rates by Race and Eco- 
r<jWe 7.2.6c nomic Level 



Delinquenct Rate 

Raci: 

Ilicn 

Low 

Economic Lerel 

Economic herd 


High 

Loa 

ToToi 

High 

Low 

Total 

Negro 

1 

14 

15 

6 

2 

S 

White 

3 

7 

10 

IS 

1 

19 

Total 

4 

21 

25 

24 

3 

27 


constant — a basic element in all rigorous scientific procedure. Sta- 
tistics are seldom what they seem on face value, and it is usually neces- 
saiy to go beneath the surface in order to expose their full meaning. 
SubclassificatioD, like anatomical dissection, is one probing tool for more 
thorough and penetrating statistical analysis. There are two distinc- 
tive interpretations which may be applied to the process of subclassifi- 
cation for the purpose of holding a factor constant. In one instance, the 
concealed factor may be viewed as an extraneous and disturbing influ- 
ence which should be "removed” and thus rendered ineffectual; in 
other instances, the concealed factor may be \iewed as having causal 
eignifleance which will open up a new Ime of explanation. In a strictly 
techaieal statistical sense, these two procedures are identical, but they 
have dilTcrent sociological implications. Thus, in comparing the crime 
rates of the foreign-born and nalivo-bom, we subclassified by age in 
order to eliminate that adventitious factor by holding it constant; whereas 
we subc!as.«ificd urban neighborhoods by economic level in order to 
determine whether the association between race and deluiquency was a 
nwrious one, attributable on dwper probing to the socio-economic factor. 
But whether subclassification is undertaken to eliminate an extraneous 
'■ariable or to isolate a “causative" factor, the meclianics of the pro- 
wduro are identical. Ckinvcrsely, the conceptualiralion of the results 
is independent of the procedure, whicb simply consists in subchssifying 
according to the factor assumed to be related to the variable under study. 

The employment of subclassification must of course be anticipated 
at the ouL«ct in the collection of the data. If, for example, it is desirable 
to subclassify by age, as in comparisons of birth and death rates, obri- 
ou'ly information’ on age must have been prex-iou'ly collected. If the 
rubclassification of urban area.*? by economic level « contemplated, infor- 
mation on the economic characteristics of each area, as well ts an ample 

201 




Nobmtno Opehatioxs 


supply of data, must be provided {ot. Such breakdowns cannot be under- 
taken as an afterthought. This farther points up the necessity of a 
research design to guide the ctdlection of the data, without which the 
assembled information cannot be effecthrdy exploited. 

Sube}at»ification and Sfandarrfirafion Cimpared. Subclassification differs 
from standardization in that it is purely descriptive and yields as many 
measures as there are subclasses. A standardized rate, on the other 
hand, is hs^jothetical rather than descriptive; it is a single composite 
index that is weighted by a set of subclass frequencies which are used 
as a standard. It hypothesizes what the measure would be if the selected 
subclass frequencies did in fact obtain. Thus, we calculated the percent- 
age of men married in 1950 as if they had had the same age distribution 
as males in 1890. 

Since the standardized rate b a hypothetical rather than an observed 
measure, a certain amount of caution b always necessary in its inter- 
pretation. It must be remembered that standardization b basically a 
process of reweighting, the Ic^e of which must always be apprabed in rela- 
tion to the standardized results. But all normed measures are derivative 
rather than raw measures. To interpret them will always require a 
reserve of statistical and subject-matter sophbtication, tbe more so as the 
chain of derivations increases in length and complexity. 


Questions and Problems 

1. Define the foUo«isgconc«pts' 

Subclassificstion 
Standsrdiiation 
Concealed Factor 
BpuTvoua Factor 
Causal Factor 
Partial Association 
Partial Table 

2. In 1900, nearly 100,000 children graduated from United States high schools; 
in 1950, over 1,000,000 graduated from high schools. Criticize the statement: 
“Bigh school graduates increased tenlold in 90 years.’* 

3. For some years the crime rate for forrigd-boni has been declining as compared 
to the rate for native-born. In view of your familiarity with immigration history, 
how would you, in part, exphun this decline? TVhat base would j-ou use to 
calculate tbe ntee in order to prodnce a letu^Ie impression? 

4. Do standardised rates necessarily differ from crjde rates? 

5. Eiplain why the effect of standardization varies according to the choice of 
standard population. 

202 



§7.2 SCBCLASSIFICATIOK AND StANDABDIZATION 


6. Convert the cell frequencies of Table 7.2.6a to percentages of column totals. 
Is the association between the two variables thereby modified? 

7. Standardize the 1890 marriage rate of males on the 1950 age distribution 
(Tables 7.2.3a and 7.2.3b). 

8. Using Table 7.2.2b, standardize the forrign-bom crime rat* on the native-born 
age distribution. 

9. Standardize the Kansas death rate on the Michigan age distribution (Table 
72.7). Interpret. 


Talk 7.2.7 Age-Specific Death Rates, Kansas and Michigan, 1950 


g 

Kansas 

Michioam 

Pojmlaiion 

Age-Specific 
Death Rate 

Popuiflfipn 

Age-Specific 
Death Rale 


199, SS2 

6.7 

703.861 



299,267 

.6 

1,040,657 



277,504 

1.3 

922,273 



282,155 

1.6 

1,042,819 



253,327 

4.6 

897,921 



21C.C97 

6.4 

736,124 



182,249 

14.6 

566,461 



127,010 

35.4 

319.653 



56,433 

91.0 

120,310 



10,770 

2142 

21,687 



1,905,299 

10.0 

6,371,766 



Source*: U.S. N»Uod»1 Office of ViUl SUtutir*. VUoI Sali»(iee e/ IA« U.S., 19S0. »ol. III. 
»B53. Table 47. U.S. Government PtinUng Office. Wnshinglon. D.Cm 1053: VS. Bureeu 
of the Census, Stolufical Ahitract of the U.S.. 1950. U.S. Goveramenl Printing Office. Weeli- 
Ington, D.C., 1951. 


Selected References 

Bureau of the Census, Handbook of Sfahsfieof ilfefAo* for Denw^phers. 
U.S. Govemment Prmtmg Office, Washington, D.C., 19al. Chapter 3. 

Merton, Robert K., and Paul F. Laaarsfeld, ConhnuUU* in 

tearek. The Free Press, Glencoe, Illmois, 1950. Pages 133-167. 

Eilberman, Leo, “Essential SUtistical Concepts and Methods in 

rsphy," an appendix to Popitlabon PnbtcmK, 2d edition, by Paul H. 
Landis and Paul K. Hatt. American Book Company, ^ew lorK, I'oo*. 

Zeisd. Hans, Say It vnth FifurfS, 4U» edition. Harper & Brothers, New 
York, 1957. Chapters 8 and 9. 


203 





Probability 


Section One 

The l^atun ej Probability 

The Probaiilisiie View 0/ Nature. Society, as a collectivity Of iDdividusIs, 
could not exist without more or less uniform patterns of social behavior 
which assure a certain fulfillment of our mutual expectations. Such 
assurance is equally necessaiy in our relation to the bdiavior of physical 
nature. 

On the other hand, experience has also taught us that the fulfillment of 
our anticipationa is accompanied by considerable uncertainty. It is this 
uncertainty which gives rise to statements of probability. We may 
estimate the probability of war, of an economic recefision, of the opportu- 
nities for a job with or without an academic degree. We say that rain is 
very probable, and carry an umbrella (or not) according to our confidence 
in such a forecast; wemay gamble on a head or a tail; we speculate on the 
eex of a new baby, Sociol^sts attempt to measure the probability of job 
success, of a juvenile delinquent’s becoming an adult criminal, or of a 
marriage terminating in divorce. Thus, the layman and the actuary, each 
in his own way, estimate the probability of death before 70, of a wife’s 
survival, and even the piobabilvty of twins. On such estimates, man mahes 
numerous decisions which guide his action (he gets married or gives up 
smoking) and even constructs a plulosophy of life (he becomes a puritan 
or an agnostic). * 

The forecast of any outcome must necessarily be based on past observa- 
tions. All statements of probability are a leap into the future from the 
springboard of past experience. Only a prophet is emancipated from the 
continuity of natural events. To the ordinary mortal is not given 
the power of prophecy; he can only extrapolate from the past. 

204 



PROBABIUTT 


On the prospects of war, adequate infoxination b difficult to come*by, 
but the prospects of death before the age of 70 are well codified in our 
actuarial tables. 

From all this, it is apparent that probability is not an absolute condi- 
tion of nature, but rather an organization imposed on nature by the 
observer. It is a purely human estimation of the likelihood of an event 
in the future, based on acquired knowledge. A statement of probability 
will therefore have no inamutable value, but rather will vary according 
to (a) the information held by the individual who formulates it and 
(b) the specific conditions under which the event b perceived. Thus, a 
farmer who views the cloud formation, a grandfather who contempbtes 
the fluctuating pain of rheumatism, a meleorologbt with hb charts — 
all would forecast the weather with varying degrees of accuracy. Obvi- 
ously, so far as natural law is concerned, the rain will or will not fall, 
independently of any weather man and hb elaborate probability tables. 
Such events were occurring before probability statbtics were ever in- 
vented, and presumably will continue to occur whether the meteorolo- 
gbt b looking or not. There can be no probability statement independ- 
ent of the human observer who formulates it 

Nor can there be a statement of probability whoUy free of assumptions 
regarding the conditions under which the event occurs. No measure of 
probability is unconditional, for the simple reason that every event occurs 
under given circumstances which affect the degree of its probability. 
The seU-same fanner in Kansas will have varying probabilities of death 
according to whether the probability b calculated on the total popula- 
tion of the United States, on the general population of Kansas, or only 
on Kansas fanners of his age. It all depends on the composition of the 
itt or ruhset in which the fanner b located. But no matter how refined 
the measure of probability becomes, it still represents a human classi- 
fication of natural events, rather than Nature herself. 

Nevertheless, for all their relativity, probability statements cannot be 
framed arbitrarily. Their formulation b governed by fixed rules, the 
validity of which has been confirmed in countless problems and applica- 
tions. But before turning to the amplest of these roles, collectively 
known as the "calculus of probabilities," it will be profitable to examine 
in a general way how the predicted event b statistically conceived. 

DeUrmin-ing and Chance Faciors. We conceive of any event as a juncture 
or resultant of innumerable factors, which may be classified into two 
broad categories: determining factors and chance factors. 

The determining factors are those which have been bolated and to 
which we attribute the eTplanaUoa or causation of the event. Thus, 
we have learned that divergent cultural backgrounds of married couples 
may predispose toward divorce; broken homes are associated with spe- 
206 



58.1 The Nature or Probabilitt 

cific types of delinquencj’. The identification of determining factors is 
never complete, and may in fact consist only of plausible hypotheses to 
be checked. In any case, the focus of science is always on the determin- 
ing factors, since they supply the basis for the understanding-and pre- 
dicting of events. 

Chance factors are the remaining or unknown factors affecting an 
event. Now, chance is not a ^mple concept. It represents a miscellany 
of factors, and is invoked to the extent that there are no assignable causes. 
In short, chance is a term covering our ignorance. Nevertheless, we must 
come to some understanding of a concept which has proved itself so 
indispensable in man’s effort to explain nature. 

For statistical purposes, chance factors are presumed to be (1) very 
numerous, (2) relatively minute, (3) independent of one another, and 

(4) largely unidentifiable and therefore not measurable; consequently, 

(5) in effect, they work collectively to produce equally likely events. 
IMien we are unable to assign a cause, we resort to chance as an “expla- 
nation” and equiprobability as a prediction. Although there is an element 
of chance in the production of every event, its characteristics may best 
be analyzed by using games of chance as a model — situations in which 
chance factors by definition operate e.xclusively. 

Thus, the drop of the ace of spades into the bridge hand of North is 
a result of sheer chance. (1) It is the result of innumerable factors, such 
as the location of the card In the previous deal, the amount of shuffling, 
the position of the dealer, and the hke. (2) Each of these factors alone is 
relatively impotent and is easily counteracted by other factors. (3) These 
factors act independently of one another; the location of the ace in the 
undealt pack, the zeal of the shuffler, the cut of the opponent, the lo- 
cation of the hands, are all unrelated. (4) It is impossible for anyone 
to conceive of, or identify, even a small proportion of the factors in- 
fluencing the drop of the card, much less to measure their force. (5) Since 
the chance factors are assumed to impinge with equal force on a - 
cards, each card is as likely to turn up as is the ace; this means that 
all cards presumably will appear in the long run with equal frequency 
hi a given hand. A professional gambler, or “magician, on the ot er 
hand, unwilling to leave the outcome to the laws of chance, will cut the 
deck according to his illicit knowledge of the location of the ace. 

One must not be misled into believing that the distinction between 
chance and determining factors is intrinsic and in the nature of the thmgs. 
Mother Nature herself would make no such distinction. There is noth- 
ing mystical about chance; chance factors are just as materially real ^ 
are detennining factors. In fact, they may be view^ as uni^ntihed 
“determining” factors which play an important role in the production 
of events. The selection of a spouse, the choice of occupation, the out^ 
come of a football game, the fatal aeddent — aU may have their turning 

207 



Probabiutt 


point in chance factors. No event is completely accounted for without 
a consideration of their potency. 

Nevertheless, it may be pointless and prohibitive even to attempt to 
analyze them individually, especially when the event is exclusively deter- 
mined by chance factors. For example, the probability of a specified 
hand in bridge is 1 out of 635,013,559,600. How useful would it be to 
expend time and energy on redefining the chance factors in order to ex- 
plain how such a rare and relatively unimportant event occurred in the 
first place? On the other hand, science has, as its ideal, the 

transformation of all chance factors into determining factors. This 
would permit the prediction of the specific individual event and remove 
the procedure from the province of statistics. But twentieth century 
natural science has tempered thb Newtonian ideal, because of the grow- 
ing awareness that certain types of phenomena are not amenable to 
classical analysis. The following excerpt from Einstein and Infeld • illus- 
trates this methodological revision in the realm of physics. In principle, 
it could have been written by a sociolo^t: 

There is a vessel conUiniog gu. la alUmptisg to trace the motion of 
every particle ooe would have to commence by finding the iaitial states, 
that is, the liuUal positions and veloddee of all the particles. - Even if this 
were possible, it would take mote than a human lifetime to eet down the 
result on paper, owing to the enormous number of particles which would 
have to be considered. If one then tried to employ the known methods 
of classical mechanics for calcuIaUng the final positions of the particles, 
the difficulties would be insurmountable. In principle, It ia possible to use 
the method applied for the motion of planets, but in practice this is useless 
end must give way to the method of statistics. ... We become bdifferent 
to the (ate of the individual gas parUcles. Our problem is of a different 
nature. For example: we do not ask, “What is the speed of every parti- 
cle at this moment?” But we may aak: ”Iiow many particles have a 
speed between 1000 and 1100 feet per second?’' We care nothing for indi- 
viduals. What we seek to detennine are avenge values typifying the whole 
aggregation. It is clear that there can be some point in a statistical method 
of reasoning only when the system consists of a targe number of individ- 
uals. 

By applying the statistical method ws cannot foretell the behavior of 
an individual in a crowd. We can only (oreMl the chance, the probability, 
that it will behave in some parUcular manner. If our statistical laws tell 
us that one-third of the particlea have a speed between 1000 and 1100 feet 
per second, it means that ly repeating our observations for many parti- 
cles, we shall really obtain this average, or in other words, that the proba- 
bility of finding a particle witlua Uus li^t ia equal to one-third. 
gimiTariy, to ibiow the 6ux& rate of a great community does not mean 

* Albert Einstein and Leopold Infeld, Tho EMution of Phynci, Simon and Schuster, 
New York, I93S, pp. 293-299. 


208 



}8.I The Nature or Prodabilitt 


knowing whether nny particular family is blessed with a child. It means 
a knowledge of statistical results in which the contributing personalities 
play no role. 

By observing the registration plates of a great many cars we can soon 
discover that one-third of their numbers are divisible by three. But we 
cannot foretell whether the car which will pass in the next moment will 
have this property. Statistical laws can be applied only to big aggrega- 
tions, but not to their individual members. 

Whatever might have been stated in the previous discussion about 
the uncertainties arising out of the play of chance factors, it does not 
argue the entire capriciousness of chance. Although chance factors can- 
not be used to pr^ict the individual event, they nevertheless produce 
certain regularities in the beharior of masses of events which are termed 
the fairs of chance. The formulation of these laws has been an accom- 
plishment of the last two hundred years, and these have come to be the 
basic ingredient in what is known as tnfereniial stalistics. 


The PoasibUity Set. Statistically speaking, a probability is simply the 
proportion of times an event is expected to happen in the long run, in 
a great moss of trials. But such proportions cannot be established unless 
all possible different outcomes have been identified. Together, these out- 
comes that could occur constitute the poisibHity set. The probabilities 
of all possible outcomes of course sum to 1.00, since it is certain that one 
outcome of the set will occur on a given trial. Furthermore, alternative 
outcomes must be mutually exclusiw, and must be at least two in num- 
ber, for if there were only one possible outcome, there would be no uncer- 
tainty and therefore no probability. These minimum alternatives are 
■ frequently labeled success and failure. If attention is focus^ on only 
one possible outcome — termed “success” — only two probability values 
need be calculated; for example, the probability of an ace and not-ace, 
the probability of spade and not-spade. 

But in any case, it is essentia! to identify all possibilities that belong to 
the possibility set, otherwise no probability statement can be n^e. Con- 
rider the question: "What is the probability that Mr. Jones will vote Re- 
publican? Before we can answer that question quantitatively we must 
first determine all possible ways of voting (i.e., the number of parties on 
the ballot). Without that information, no probability statement wo^ 
^ possible, smee the probability of a given outcome can only be measured 
relation to all possible outcomes. W^e must know the number of all 
political parties in order to fix the probability of any single 
^^hile every logical possibility must be enumerated, they still may be 
^ously grouped according to the purposes of the user, “ne pven p(^ 
sibilities may be decomposed or recondiined into any desira e su se . 
ilultiple births may be broken down into twins, triplets, quadrup ets, an 

209 



PnOBABILITT 


the like, while home runs, triples, and doubles may be grouped together 
as extra-base hits. Playing cards may be classified according to suit or 
face value; deaths may be classified according to terminal cause. But 
however outcomes are arranged, the set must include all possible outcomes 
— that is, It must be exhaustive. 

Once the possibility set has been established, we then determine the 
relative frequency, or weight, of each possible outcome or event; for ex- 
ample, the expected frequency of a head and the expected frequency of a 
tail; the expected frequency of a male birth and the expected frequency 
of a female birth, the expected frequencies of angle and multiple births, 
or of Republican and Democratic votes, l^e establishment of such ex- 
pected proportions may be based on one of two principles: the a pn'on, 
or the empirical principle. 

A Prion Probabilities. A toss of a coin may result in one of two outcomes: 
head or tail. We say the probability of tossing a head is 4 or 50 per cent; 
the probability of a tail, exactly the same. Simibrly, we examine the 52 
cards of a weU-shuflled bridge deck, and conclude that each card is equally 
likely to be dealt. On what evidence do we weight them equally? Accord- 
ing to one theory, the only reply is that we have no reason to conclude 
otherwise: we examine the coin for symmetry, end the 52 cards for perfect 
uniformity, and the toescr or dealer for honesty, and by a priori re.a5oning 
conclude that the evidence b sufficient to pr^ict that each outcome is 
equally likely in the long run. Tbb method, not based on experimental 
test or observation, has been labeled the principle of sufficient reason.* The 
examination of a die would lead to the analogous conclusion that each side 
would be equally likely in the long run. However, an oblong eraser or 
pyramid-shaped paper weight would not offer intuitive evidence that each 
side would turn up with equal frequency in a large number oi throws. 
There b no ready means available to ascertain a pbuslblo ratio between 
the various sides on the basis of their shapes and areas. To establUh the 
probabilities of the respective sides, it would be necessary to toss the ob- 
jects a very large number of times and record the empirical results. 

In fact, the critics of the a priori school of thought contend that the 
belief in any a priori judgment or pure reason b a form of self-deception. 
The judgment that the two sides of a coin, or six faces of a die, are equi- 
probable rests on the actual experience of previous tosses as practiced by 
countless youths in every generaUon. Hence, judgments on heads and 
taib are not at all a priori, but rather based on personal experience, and 
even traditional expectations. Weare never tempted to estimate the prob- 
ability of death on an a priori basis, rince the simple conditions which 
surround the tossing of pennies and dice do not prevail in vital statbtics. 
* Some authors prefer the phrase "intoffieieDtreasoo,’' which they feet is more appro 


210 



5 8.1 The Nature or Phobabilitt 


In general, the a priori method of fixing probabilities becomes wholly in- 
adequate after we leave the very Umited realm of games of chance. 

Empirical Prohabililies. If weights ate assigned to possibilities on the basis 
of empirical observations, they are known as mpirjcal probabilities. As 
a simple practical example of the calcubtion of empirical probabilities we 
present an abridged life table of the United States (1950) and of eighteenth 
century Berlin (Table 8.1.1). The purpose of this record is to set up the 


TaWe 8.1.1 

Ace 

Number Scsviti.v'o at 
S pECiriED Aces 

Abridged Life Table, V.S. 


17-5. 

Germanij 

IBSOandGermany e. 1750 

0 

100,000 

100,000 


10 

96,177 

51,000 


20 

95,366 

49,600 


50 

66,591 

31,300 


60 

75,921 

22,600 


JohActs P. S&amilch. Die CcttiteSe OrJnunti 
26 •diUoo. 17(2. VoL II. pp. 319-323 (Adapted); 

Aad.U^. KAltoui OSe* ct Viuil Staliatiei. IVat fto- 
lietxee — Sptnal tteparU, Lijt TeUetfor ISi^lSSt, VoL 
4 ). No 1. V.S. Gorersntat Priatiat OQSre. VVAahini- 
toQ. D.C.. mi.TaUeZ. 


probabilities of death for various age groups. The elementary outcomes 
are, of course, life and death in a given year. The weights of these two 
pos.«ibiHtie3 are based on the average number of deaths over a reasonable 
period of obser\’ation. The average of the past thereby becomes the best 
estimate for the probability of the future. Thus if, out of a cohort of 
100,000 United States births, approximately 87,000 have survived to the 
age of 50, we may conclude that in the future, as well, anyone in a similar 
cohort would liave an 87 per cent probability of 5urvi\'ing to that age. 
Id eighteenth century Berlin, the corresponding person would have had 
only a 31 per cent probability of suia-i^ung to age 50. A twenty-year-old 
youth in the United States would have an SO per cent probability (Jlj'J) 
of reaching hU sixtieth birthday. This is, of course, the tj-pe of empirical 
actuarial cLata on which life insurance premiums are computed. 


211 



Probabiliti 


the like, while home runs, tnples, and doubles may be grouped together 
as extra-base hits. Playing cards may be classified according to suit or 
face value; deaths may be clashed according to terminal cause. But 
however outcomes are arranged, the set must include all possible outcomes 
— that is, it must be exhaustive. 

Once the possibility set has been established, we then determine the 
relative frequency, or weight, of each possible outcome or event; for ex- 
ample, the expected frequency of a head and the expected frequency of a 
tail; the expected frequency of a male birth and the expected frequency 
of a female birth, the expected frequencies of single and multiple births, 
or of Republican and Democratic votes. The establishment of such ex- 
pected proportions may be based on one of two principles: the o prion, 
or the empirical principle. 

A Friorx Probabilities. A toss of a coin may result in one of two outcomes; 
head or tail say the probability of tossdng a head is i or 50 pet tent; 
the probability of a tail, exactly the same. Similarly, we examine the 52 
cards of a well-shuffled bridge deck, and conclude that each card is equally 
likely to be dealt. On what evidence do we weight them equally? Accord- 
ing to one theory, the only reply b that we have no reason to conclude 
otherwise: we examine the coin for eynunetry, and the 52 cards for perfect 
uniformity, and the tosser or dealer for honesty, and by a priori reasoning 
conclude that the evidence b sufficient to predict that each outcome b 
equally likely in the long run. This method, not based on experimental 
test or observation, has been labeled the prinripfe of sufficient reason." The 
examination of a die would lead to the analogous conclusion that each side 
would be equally likely in the long run. However, an oblong eraser or 
pyramid-shaped paper weight would rot offer intuitive evidence that each 
side would turn up with equal frequency in a large number of thro\vs. 
There is no ready means available to ascertain a plausible ratio between 
the various sides on the basb of their shapes and areas. To estabhsh the 
probabilities of the respective sides, it would be necessary to toss the ob- 
jects a very large number of times and record the empirical results. 

In fact, the critics of the a priori school of thought contend that the 
belief in any a priori judgment or pure rcuoa b a form of self-deception. 
The judgment that the two rides of a ccra, or six faces of a die, are equi- 
probable rests on the actual experience of previous tosses as practiced by 
countless youths in every generation. Hence, judgments on heads and 
taib are not at all a priori, but rather based on personal experience, and 
even traditional expectations. We are never tempted to estimate the prob- 
ability of death on an a priori baris, rince the simple conditions which 
surround the tossing of pennies and dice do not prevail in vital statbtlcs, 
* ^rae iuthors prefer thephra*e ‘'ioeuffiaentieaaoD,’' which they feel is more a 

210 


ippro- 



{ S.1 The Natcbe of Probabiutt 

later find it profitable to bet conastaitly on tails, and be be nght 
more than half of the time. In this case, be has rejected the 
abiUty statement and established a new one connstent with the observed 
frequencies. Such rejection is known as a #<^isbcal *««««• 

The same testing procedure is applied to empirical probabilities, 
the case of life tables, a certain chance variation is permitted mthout 
casting doubt on the reliability of the empincally predicted death ratw. 
But when the variation becomes too great, the actuanal tables are rc\i=ed, 
and a new norm or model is set up for subsequent obser\-ations. 

From the foregoing dkcussion, it is obvious that the dete^rmtion of 
the possibiUties and their respective weightings is the core of 
calculation. The most elementarj- rules of such calculation wiU now be 
presented. 


Questions j 

1. Define the following concepts: 
Probability 
Cbanee 

Determining Factor 
Possibility Set 
Outcome 
Event 

A Priori Probability 
Empirical Probability 
Principle of Sufficient Reason 
Expected Frequency 
Low of Large Kumbets 
Equiprobability 


I Problems 


ita toUo.i.E op«t»: .ubj^tin-, iUteUrt rthtivoCa. 


2. DUcun probability in 

3. Ik, pooMe ekmce I.rto" to oOTpalion; ot of ol 

residence. 

4 0» . Tn.,^r.L.. » P'' o[ ,h, q»e.Uor4 

erne «hich ol these eltenisUve. is ihot« pkioible. 

. — ,. I .r.- v..^W7 IMS. Stated that 7 of S2 past presidents 

hL' i'S iL"l5co'.”nVoco, ITca IbUS 

raeccod Prnideot Traram io cfUa,. Commoot oo Ibo .-.Ldity ol to ,!»«- 

6. II S po, ooo, ol to .todoof io . Uteo •" ''.'‘•’“"f 

obsenitioo lodicto .boot to pnibrfoUtr o! fiodotj . klhbudol o. 

another class? Discuss. 


213 



Probabilitt 


ments. These statements must be r^arded as models, idealized on in- 
finity, with which the subsequently observed finite ratios can only occa- 
sionally agree. We will only rarely observe exactly 50 heads on 100 tosses 
ot even a perfect penny, although the probability of heads is quoted as 
50-50 Such discrepanciea between expected and observed frequencies 
will result either from (1) continued operation of chance factors which 
introduce an indefinite amount of variatiem, or from (2) the operation of 
changing conditions which make the initial statement of probability in- 
valid These discrepancies ultimately lead to one of the central issues of 
inferential statistics (decision-making, Chapter 12), which at this time may 
be given only preliminary mention: how may the discrepancy between 
expected and observed be interpreted — as chance variation around the 
stated probability value, or as evidence against the initial statement of 
probability? 

Let us suppose that, in a large number of tosses of the same coin, a 
gambler observes heads 40 per cent of the time, instead of the a priori 
so per cent. At this point in the game, how should he react on the next 
bet? He must decide whether to accept the discrepancy as mere chance 
variation to be ignored, or reject the initial probability statement as false 
and revise his expectation 

His first line of reasoning might be: the law of large numbtrs prescribes 
that with an increasing number of tosses, the approximation to the hypo- 
thetical true proportion must become correspondingly closer. The “defi- 
cit” of heads must have resulted from excessive runs of tails. Hence, 
sooner or later the heads will have to “catch up” in order to make up the 
deficit. Therefore, "I will now bet on heads.” However, such a judgment 
is inconsistent with the usual assumption in penny tossing that each out- 
come is independent — that is, uninfluenced by previous ones. The coin has 
no “memory” of previous outcomes, and therefore heads and tails are 
still equiprobable on the next toss. After all, the 50-50 division of events 
holds good only for an infinitely large number of throws, or trials. Any 
finite number of trials is not enough to equalize the outcomes. Hence, the 
deficit need actually never be made up. Therefore, in spite of the unusual 
runs of tails the bettor should not change the pattern of his strategy. He 
has accepted the discrepancy as due to the play of chance factors. 

On the other hand, if the run of tails peisists beyond the limits of rea- 
sonable tolerance, the player might well raise the question whether the 
orthodox hypothesis of equiprobability was valid in the first place. Per- 
haps the coin was not sj-mmelrica! after all; perhaps the coin favors the 
tail, as the empirical evidence seems to indicate. In other words, if the 
discrepancy between expected and observed frequencies becomes too great, 
we should question the bypotberis, and reject it accordingly. Although 
there is no fixed rule which could guide the observer in determining at 
what point the length of the run becomes suspicious, he would sooner or 
212 



1 8^ Peobabiutt Caicui^t:o» 

atbitarily defined as ‘'success"; if the probability of sunnval b in ques- 
tion, then life would be considered “success.” An unlimit^ num^r ol 
repeated or concurrent trials is essential to the concept of probability. 
Conventionally, we speak of sath an infinity of tri^ as con^tmlly re- 
peatable trials, or experimenls, rince infinity can exist only m the imag- 
ination. 


Hemcnton, RuU> «/ Cntadalion. -Wkile »11 prababmty calcuMona are 
fundamentally alike, they differ in detail acconfcg to to 

under eonsideration ia (1) o aimpfe ercnl, (2) fl joint eotnt, or ( ) on 

fi'reepent. .. » • »: 

(1) Simple Seenl. A simple event ia to lowest imt of observatio 
defined lor to data under analysiii. ft may be to head of a com, i^cct 
as a cause of death among to 200 standard causes, a mnlUple birth, a 
successful parole, a divorce, or a hit in baseball. 

Hhe elementary nature of to event is not intimnc but is rather a mtte 
of definiUon to be decided by to person doing 
cancer could be profitably decomposed into a nirob^ of 
according to to Sle of to affliction; . multiple both may ^ subeyri 
into twbs, triplets, quadruplets, and quintuplets. Any simple “Jitome 
could conceivably be broken into fioer classification if to proto m to 
so dictated. Therefore, a simple event is one tot b 
Bidered by to observer. To calculate to probabihty of a simple event, 
we apply: 

Revs 1. The probabiUty of a rimple eient “ ^‘teSto 

of that event to the total frequency o[ all posaiMe evenia in me 

set. 

For ummple, in an avemge year, tore are 

in the United SUtes, of which approximately 46.000 (n) are mult p 
Therefore, the probability of a mulUpIe birth is: 

46,000 _ 


■ 46.000 + 3.954,000 
46,000 
” 4.000.000 

X 

“87 

This ratio, a partial rtatme^ 

amazingly representative of the whole «a west 

sista of innumerable trials. p„u l 

l^-hen to Irequeneies of a set of »mple events are id.nUeal, Rule 

reduces to: 



rnoDAnatTT 


7. It was reported that 15,000 intoxkatrd pedcstriaw were killed hy cars, and 
only 5,000 intosicatod drivers killed. This means that the probability 
of a pedestrian being killed js thiea time* the probability of a driver. Comment. 

8. Comment on the statement- “ Improtablc ei-ents arc catremely common.” 

0. Does the principle of ehanec apply to such “certain outcomes'' as eclipses, the 

rl'ing sun, and other events which arc practically certain to occur? Com- 
ment. 

10 A soldier jumped into a foxhole which had just been hit by a shell. He 
reasoned that a shell will not, by chance, strike the same spot again. Crlticiie. 
{Hint: Is the cliancc hypothesis the only passible hypothesis?) 

11. Comment on the eIichMhat*'lishtfui)g does not strike twice in the same spot.” 

Section' Two 
Pnhabtlto Calculation 

ProhoWU]/ as o frcTuency Tiatio. To nsk “UTiat is the probability of death 
from cancer?" is essentially the came aa askins "What fraction of all 
deaths arccau^ by cancer?" Although a competent demographer would 
wish to rpocLfy the cohort of popubtion to wiiieh the qurstion referred, 
this H ould not cliange the fundamental nature of probability as a frequency 
ratio. Hence, to answer the above question we would Lave to express the 
frequency of death by cancer as a proportion of deaths by all possible 
causes, including cancer. Attout 20 per cent of all deaths now occurring 
in the United States arc caused by caneqr; therefore, the probability or 
expectation of death by cancer among subsequent death®, would likewbe 
be 20 per cent. This experience ratio of 1 in 5 trials is u«cd to forecast 
the events of the long-run, on the assumption that the conditions of death 
remain constant. Such a ratio is conx-entionaliy e.xpressed : 

rr(E)-;^ (8.2.1) 

in which PriE) •• the probability of a pv-cn event, E 

n - the frequenej* of occurrence of the given erent 
w •• the frequency of all other events in the set 

Dtjimlion of a Trial. The concept of a Inol docs not necessarily limit 
itself to a contrived net, such as the tossing of a coin, or the tri.al of a 
batter who may or may not gel a hit. It may also refer to the iuterroga- 
tory tiWAudc rftbeobsCTvwwhoawa'rtsttie outcome oi a natural situation 
which may eventuate in one of several waj-s. Thus, the life or death of 
a given member of a popubtion cohort is the outcome of a trial in a statis- 
tical sense. If the probability of death b to be calcubled, then death b 
214 



1 85 Peobabiutt CaI/CTUtion 


Rble 3. The probability of two or more orents occurring together, or a 
joint event, is the product of the probabiliUes of the indundinl 
events. 

Thus, the probability of two fives is: 

Pr(5 and 5) “ Pr(5) X Pr(5) 

= 1x1 


(3) AUtmnlice Eciml. We may trish to bfoaden the dcfinilioa of svi^ 
ce» by resariiog any one of a number of alternative “ ““ 1 ^ 

tuting success, such as any one of four accs. ^\e may - 

not to “win." but to win. place, or show. Statisticians usualb h nk of 
these alternatives as constituting a subset we 

By increasing the number of acceptable possibilities in e = . 
necessarily increase the probability of succ^, ance ajjg 

probabilities of the respective outcomes. Tlds pnncip e exp 
oddition Vitorem: 

Rota 4. The probabOily that oae of Wo or 

native eienta eill orcur is the sum of the protobJitira of the 
individu&l events. 

For esample, by this theorem the probability of aoy one ot the four aces 
would be expressed ; ^ 


Pr{A\ or At or At or j1i) “ 50 


“62 


The addition theorem may be applied not only 

aces in a deck of cards, but abo to joint cvcnU Thus, in 

dice previously alluded to, there are sL\ wa>-s of o taming 

(1+6), (2 + 5), (3 + 4), (4 + 3), (5 + 2), «> + ’> 7 . S psh 

obtaining 11 — (G + 5) and (5 + 6) — on j throwing a 

of dice. By simple addition, we learn that tji^ nrobability of 

7 U and the probabiUty of an II b Therefore, the probabilitj 

tossing cither a seven or eleven would be 8 out of 36. 

(hilcomes Xol Mut^ially Ezclasive. Rule 4 holds 

mutually exclutit e — that is, for outcomes that cannot c 

» single trial. Thus, a head and a tail cannot 

coin is tossed; a birth cannot be single and multiple at in 

Yet, b many bstanccs, a given object may be cbv-i . n., an 

more than one way: a playing card may be rend simu ^ 

ace and a spade; an individual may be characterized 
Catholic. Such traits arc not at all mutually exclusive smec thej rej^ 



Pbobabiutt 


Rcii. 2. The probability of a given event in a set of equally likely events 
IS the ratio of 1 to the total ntunber of possibiUtica, h. 

Tliat is, 

Pt(E) - 1 (8.2.2) 


For instance, in the toss of a coin there are two possible outcomes, assumed 
to be equally numerous in the long ran; hence, the probability of ahead is: 




2 


(2) Joint Event. Events do not neces'^irily occur singly; two or more 
distinguishable events may occur together to form a joint event, sometimes 
labeled a eempound event. The toss of two dice, for example, will always 
result in concurrent events such as 1 and 1, 1 and 2, • • •, 6 and 6, represent- 
ing the faces of the dice. 

If according to Rule 2, the probability of a 5 on Die 1 Is i, and the prob- 
ability of a 5 on Die 2 is likewise i, then the probability of their occurring 
together would be This result may be clarified visually by means of 
the accompanying dugram which deploys the 36 possible ways in which 
the two dice may fall together — llvat is, the 36 joint outcomes. While 
there is only one way in which the two fives may fall together to give a 
probability of a 2 aod 3 may also occur as 3 and 2 — • in two ways — 
lor a probability of The probabilities of other joint outcomes may 
similarly be read from this diagram. 



This cumbersome procedure of listing every possible way is not nec- 
essary'in practice; instead we ahrays apply the product theorem, as ex- 
pressed in: 


216 




{ 8^ Pbobabilitt Calculation 

Thus, in computing the probability of drawing Wo am >" 
from a bridge deck of 52 cards, the probabd.ty of diawing the firet 
is, of course, *. But the drawing of one ace ■* 

obviously affects the probability of drawing another ace on . 

There bLg only 51 cards remaining (iocludmg ^ 

bility of the next ace is now *. The i»“‘ ^ 
of the individual probabilities: ^ X ^ t.est- , svithout 

the long run, out of 2,652 trials eonsisUng of two random draws without 

replacement, one would expect 12 outcomes of dou e maninula- 

Thb iUustration, which is convenient because of its P 

tion, is a formal, routine demonstration of a familiar principfc However 
in the practical affairs of social life to say no mg j 

it is not always so easy to detect the presence of " 

more difficult to measure its degree. Dependence is “X, 

- a, are all social events - and accurate calculation » 'i 

Consequently, we are frequently obliged to emoirical data are 

dependence, and forgo its calculaUoo, sioco the needed empirical data 

“ripple, actuaries — ^SSilSiTS 

viving to celebrate their golden wedding. Ife . 

and the groom 20, they would have to live a eas tables for the 

76. respeW Now. wc know *'7 

United States that the 2G-year*old male 7 . . j chances 

viving to that age; the 21.year.old .''"."^jpre^u'et th. 
to survive for fifty years after marriage. By PP > .gfiuircd ages 

orent, we find thal the probability of both fh. 

would be ixi-t or .20. Bo. inde- 

assumption that the survivals of the u o p - chosen 

pendent. It seems incredible, spouses tend to 

at random. Since it is plausible to bchevc t . . ; ..^prorg ,n opera- 

marry healthy mates, in that respect them are '““j 

tioo, as well as chance factors. " ptjij. to iofiiencc the 

share a common way of life. ^,;”:^';‘;7feornpletcIy md^ 

health of the other. Hence, thew Hon renuires. Far being 

pendent as the above probabilistic calcolat between the 

independent, there is probably a rather hig eo were 

survLl ratU of married spouses. These 
obtained from the general life tables, simply bcca 

■ , . _ ^ trnditiirnsl cIau*^ w 

come of in the joint event. The rea«oii for p-nersl f ’lm, hoW* 

fhnpV that the multiplication rule for JowJ.f’T" ' eventi are UcLuik m in* 

l>oth for Independent and dependent pro^lwitie , »llo» for the drew o* 

dependence, then the probnbihUea must be duly ^ , invalid renilta 

dependency; otherwiie. the product th^iw * „ probabilities, provided of 

thelees, the product theorem doe* apply to *0“ ^ 

rourse they have been proiierly coroptrtea 219 



pROBABiinr 


resent different dimensions of the same object. It would therefore be 
erroneous to apply the simple addition theorem, which holds only for 
mutually exclusive events, to events which may occur together. Thus, 
to calculate the probability of an ace or a spade, we must not add the 
probabilities of these simple events and let it go at that. The probability 
of an ace or a spade is not: 

This probability is loo high, since we are erroneously combining two 
non-exclusive traits. When outcomes that are not mutually exclusive 
are produced, it is obvious that the alternative outcomes will, in a cer- 
tain proportion of cases, occur together. Thus, the alternative events, 
"ace” or "spade,” will occasionally both turn up together os the “ace 
of spades” and must be accepted as a success. 

However, in adhering to the umple addition theorem (Rule 4), the 
ace of spades will mistakenly occur twice, whereas every other event will 
occur only once. But theoretically, the ace of spades should occur only 
as often as every other card — once in 52 times. Although the ace of 
spades is accepted as a success, wc are not permitted to count it twice as 
often as its random appearance would justify. The foregoing addition 
rule must therefore be amended by subtracting the duplicate "occur* 
recce,” which is measured by its joint probability, or The proce* 
dure just described is now generalised as: 

Rule S. The probability that one o( two alternative eveots, not mutually 
exclusive, will occur is the sum of Uie individual probabilities, 
minus their joint probatnlity. 

According to this rule, the probability of an ace or spade would be: 

52 ■'■52 52 ” 52 

Independent and Dependent Outcomes. By independence is meant that 
a given event or outcome is uninfluenced by the preceding or concurrent 
events. Thus, the probability of a head on a given trial is independent 
of the previous falls, whether head or tail The probability of the ne.xt 
outcome has not changed because of the previous event. 

However, when the probability of one of the outcomes in a joint event 
is in any way conditioned by the previous outcome, we say that the 
second is dependent on the previous event In that case, a revised proba- 
bility oi the second event must be computed to take account of that 
degree of dependence before the multiplication rule can be invoked. * 

* The above formulation of Rule 3 oimts meDtioa of the usual provision that probabil- 
ities must be independent, i e , that the outcome of B is in no way Reeled by the out- 


218 



Peobabiutt Calcuiatio?! 

ctacl it that thac. With aU his he ™ sho a m^t ol 

ieiassCasahsettreprcscnlmsalaip! number of sm,JarhMoneal.^^^^^ 

The question should therefore bo repbrasri to read: ^at “ P 
ihiliy- of persons like Caesar, under sinniar ennumstancK 
to Britain? Althoush Uiesc forces do not fend themKh ra ' J* 
measurement, an intuitive but profitable appmaunation »' “ 
be obtained. Many important human deeumna, “X” 

of seientiiie hvpotheses and leEul iudpneala on 
rather than on methodieal quanUtativ. procures. 
nould reserve U,e eoneept “likely;; for sui SeT on 

and employ the concept “probable only vhen 

rigorous quantitative observations. holds that a 

A th J possible interpretation of^e 
smgle outcome can never set up, * -s^blished or confirmed 

The reason for thia is simply that statisUea do 

only by the results of a large number of tn^ __ ^ molaUon 

M to "tell us ani-thins about the smgle ® probability of 60 

uould not bo aufficient to disprove the pLemtiens, since 20 

per cent suecess, established by ,o b^ huTrB. ft is only 

per cent of the cases are confidently . constant factors 

alter a large number of that the validity of a 

have had an opportumty to ma^cst them ' proved. We 

probability statement can be made pUuable, g^g 

™id qumtiou the hypothesis of a to com alter a run « ^ 
heads, and certainly reject it after a run of perhaps twenty nve 

Questions and Problems 

1. Define the following concepts: 

Trial 

Simple Event 
Joint Event 
Alternative Event 
Addition Theorem 
Product Theorem 
Dependent Event 

Independent Event „[ „hid, 

2. According to the International list, there w j disease, 

mneer i, one; hence, a pemon has * pmtabditj of « 

Comment. selected two in- 

3. In a group of 6 persons, there arc 4 boys ^ 2 • a boy first and 

dividSs by ch^a what would be the 

a Ed ««,ad7 a girl fcst m.d a boy ^ 

4. For the foUowing distribution of J^h “Q v^ajoeamous marriages, 

calculate the probability of mixed marriages 



Pkobabiutt 


were not av-nilible Weighting for «ub«eta by marital rtate, occupations, 
and other relevant cLwificationa art cot provided. Hence, itrictly speak* 
injt, the joint probability of | ts an over- or undcr-elatcmcnt depcndmi 
on the healtli of the cohort. Only if men and women were randomly 
paind would the joint prob-ability (^.20 Invalid. Hut even if the depend- 
ent prob.abilitiM could lie a< cwily calcuintcd as in the ca.»c of the two 
arcs, it is not hkcly that a rcBnement of theso calculations would greatly 
influence the rom-antie mating process. 


The Probabilit'joJ a Stn/jU Ettnt. Since the concept of probability assumes 
innumerable repeated tmls. which give the lasv of Urge numlwrs cnourh 
room to operate in, it is quite conventionally n*«ertrd th.sl a c cannot assign 
a probability to the outcome of a fingle trial, ’fhis apparent truism is not 
as simple as it appear?. There are tlirce possible reactions to this ambigu- 
ous statement 


In the first place, we do act on probabilitirri, and we commonly do 
apply them to single trials, Tims, “in the clutch," a pinch hitter is rclcctcd 
on Uie differential probability of his getting a hit. The m.an.sgcr will reason 
M follows: "Jones has a batting average of -100 (empirical probability); 
If I send him up to bat, he is more likely to succeed than Pmith, who hu 
only a .250 average." Analogously, a person who is ill calculates his ewu 
private chances for recovery and makes plans accordingly, however b- 
aillcront the insumnee company ntay l«e to the identity of the liidividiisl 
^ m Its actuarial ma«s ftatlsUcs. Thu?, eneh pejson continually codifies 
ms es^riences and probahilistleally preparw for the altemalive outcomes 
• in each single in«tance. 

In the second pUce. a single ca.»c may be thought of as a wholly unitiue 
e cnt. \\ ith ihi, interpretation, it would wm douhly clear ih.nt no pmb* 
a 1 1 } sta ement could be made, since there can be no long run of unifiue 
*n3t matter, no run at all. Peing unique, the event could 
obviouriy could not occur again. A 
ernmniL' V t be both impossible and ujclnw. For 

and rpi' M Cae*3r had tossed one of his coins, we could properly 

na rcim,|y st.it e the probabUity of hb h-iving turned up his image thereon. 
havmrF prccWon estimate the probnbility of hb 

ine J' *4'! ?'«verlbcless, liLslorians are coa«tantIy specubt- 

sonacps in^i • ^4 "unique” actions of Caspar and m.iny other per- 
can the sUtlstical principle forbidding such 
thinking *'*** ***** and useful deductions of 

event is^nM*^*'4”4 'If* sociologically speaking, a unique 

he etill nr, ^ ■^•thouBh there was only one Julius Caesar, 

his broad »thcr leaders of men. not only m 

to broad chametenst.es. but also in the potentialities of a trip across the 



{ 8^ Combinations and Pehmutatioss 


This count will depend on whether we focus on the membership of the 
group or the order in which the component members are placed — that is, 
on whether we are dealing uiith a combination or permutation. Statistically 
speaking, combination denotes the identity of the indinduals that compose 
a group; while permutation denotes the specific order, or arrangement, in 
which the members may be placed. 

In order to determine the number of combinations or permutations in 
any given set, we could of course bboriously enumerate all possible ways, 
as has been done for the tosses of two dice (Section 2). But in practice, 
there are more efficient meth^ by which this result may be accomplished. 
To these we now turn. 

Combinations. Any group or set of distinct objects is called a combination, 
sjunbolitcd C. Such a joint event may consist of a list of digits, a handful 
of coins, a group of nine baseball players, a series of births, a set of postage 
stamps, a bridge hand, a list of True-False questions, ora sample of a human 
population. The su^titution, addition, or subtraction of even a single 
item in the given set produces a different combination. Thus, the two 
sets of digits, 2G9S and 2G97, constitute two different combinations. The 
Items must be distinct, but need not be distinguishable, as in 4444 and 
44444, which are different combinations. For a given problem, a set of 
five pennies may be considered as indistinguishable, although each bears a 
different date — a fact which may be, however, important to a numismatist. 

Permuialiont. The above illustrations make it endent that in many situa- 
tions we are interested not only in the identity of the combinatorial ele- 
ments, but also in the order of their arrangement. A telephone number 
of 2G9S is not identical nith 29S6; the same baseball nine may be put 
forth in different batting orders; five classmates may bo chosen in differ- 
ent orders of sociomctric preference; the pins in a Yale lock may be placed 
in different orders so that only the corresponding key will fit it. In the 
above cases the number and identity of the itcias remain unaltered; 
they remain the same combination, but they take on different permute- 
Hons because two or more items in the arrangement are re-ordercd. When 
the focus of attention is on the order of things, the series is a permutation, 
symbolLied by P. 

Popular language docs not alwTiys correctly dUtinguish these two 
terms. In order to open a bank vault lock, consisting of gears and pias 
which must be actuated in a specific sequence, the operator must know, 
strictly speaking, not merely the combination, but al-o the permutation 
of the dial numbers. A rearrangement of any two (or more) of the pini 
would produce another permutitioa and make it impossible to open it. 

The Calculation of Permutations. Any or all of tlie objects in a combina- 
tion may be permuted, provided the objects are dutinguidiable. This 

223 



Probabiutt 


assuming that all marry and that religion of husband and wife are independ- 
ent. 


Female Male 

ProUelant 20 30 

Jetrish 60 SO 

60 80 


5. Given a group of 100 persons, of vhom 20 are Republicans and SO Democrats. 
If 5 are selected by chance, vhat would be the probability that all would be 
Republican? 

6. In a large number of throws of two true dice, how often would you expect a 
total of seven? 

7. If two dice were repeatedly throan, which event would surprise you more: 
a double^ix on the first throw, or on any one of the subeequent throws? 

8. Comment on the statement: “All individuals io the group are by definition 
subject to the same probsbihly etaterocot; hence, the individual, In the 
cohort for which the predicUoo b made, cannot justifiably exempt himself 
from the operation of the quoted probability." 

9. If the Income Tax (Internal Revenue) Office selects a 25 per cent random 
sample of the returns for checking, what would be tho probability of a person 
being selected for cheeking two years in suceessioDT Comment. 

10. You have bought 10 chaoecs, at Sdl5 each, on an automobile which b being 
taSfled lot charity. Suppose you wished to sell 5 of these raSle tickets. ViTsat 
would be a fair price for a person not interested in charity? Discuss. 


Section Three 

Combinations and Permutations 

A joint event is, by tlcfinition, a grouping of individual events. It could 
comprise the letters ia a word, a bridge foursome, or a group of musical 
notes. But groups can usually be assembled in more than one way. Thus, 
there are 30 ways in which two dice may fall; a 3 and a 5 may occur in 
two ways since each figure can appear on either one of two dice, as has 
been previously shown. Sinukirly, letters can be scrabbled — arm, ram, 
and mar; four persons may be seated in various orders around a table; 
the same musical tones may lake various positions in a chord or in a 
melodic sequence. A facility in such regrouping is of prime importance 
in the calculation of probabilities, because the magnitude of a joint prob- 
ability w necessarily increased in proportion to the number of ways in 
which a specified joint event can occur. 

222 



1 8.3 Combinatioks and Permutaitons 

ways. If all ten digits (0-9) were available, the computation would be 
10 X 9 X 8 X 7 X 6 X 5 X 4 X 3 X 2X1 = 3,628,800. The principle here 
invoh'ed is referred to a? the ilvUipliealim Theorem, the generalized 
formulation of which is; 

If an event can occur in Nx wa)'3, and thereafter in Nt vaj-s, and so on, 

these successive events can occur in that order in ATi X IV* X ■ • ■ hTw ways. 

This theorem is employed to both of the fundameotol patterns of permu- 
tation: (1) complete, and (2) partial. 

Complete Permutation, or Fadorial. A complete permutation consists 
in permuting all objects in a set, all at a time: that is, K objects, A** at 
a time. The number of permutations resulting therefrom is called the 
factorial of the number, and is symbolized: 

p;i - m (8.3.1) 

In permuting four distinguishable objects, 41 would cumubte to 24 differ- 
ent orders, as has already been illustrated. A factorial of a number (AT) 
is therefore defined as the product of all integets to 1, inclusive. 

Parfial Pemutofion. A permutation of N distinguishable objects, less 
than M at a time, may be called partial permutation, symtwUzed by 
P,\ to be read “the number of perroutations of N objects, r at a time!." 
This formula is a directive to carry out the permutations only through 
r stages, instead of through the total fV. 

P,'' - - 2) • • • (;V - r + 1) (8.32) 

For e-vample, the number of four-digjt telephone numbers, when all ten 
digits are available (but when the same digit is allowed to appear only 
once in any given telephone number), would be derived as follows: 

Pi"- 10x9x8X7-5,010 

Recurring /ferns. Combinstions which contain indistinguishable items 
arc amenable to permutation only in proportion to the number of dis- 
tinguishable objcjcta. Nevertheless, there are certain combinations in 
which the recurrence of items is not only permitted, but even required, 
as, for instance, in the ease of telephone numbers and computer punch 
cards. 

In the case of discrete physical objects, such recurrences are obviously 
impossible. In permuting nine boys on a baseball field, physical circum- 
stance precludes their recurrence in successive positions on the same 
“trial." A boy cannot play first base and center field in the same lineup. 
The fame principle bolds for sociometrie arrangementa. However, nu- 
merical symbols and letters of the alphabet do not suffer from this limi- 
tation; hence, if one wishes to deternune the total number of all possible 

225 



Fbobabiutt 


proviso is quite logical sinee a re-oidering of iudistiogulshable objects 
is meauicgiess: a rearraDgement caimot be recognized insofar as the 
objects are considered to be absolutely identical. 

If the number of objects in a combination is small enough, the permu- 
tations can be formed by inspection and counted. Thus, the three digits, 
2, 5, and 6, can be permuted in ax ways as follo\7s; 

256 562 625 

265 526 652 

The addition of one digit to the set (256S) yould not change the number 
of potential combinations, but vould increase the number of possible 
permutations from sLt to 24, isibeireedta^am in Figure 8.3.1 null demon- 



Fiotmx 8.3.1 Tree Diagram oj Permulatima 


strate. But the same result could have been more quickly achieved 
without such extensive and detailed figuration by simple multiplication 
according to the following logic: 

Examination of the foregoing 24 permutations reveals that the first 
digit in a permutation could be anyone of the four — that b, the first 
digit could occur in four ways. After each of these four possibilities, 
the next digit could fall in any one of three remaining ways to join each 
of the preceding four, yielding so far twelTC different permutations, two 
digits at a time. Thus, the pattern of calculation could continue until 
all the available digits are used up as above. Summarizing the arithmetic 
procedure to count the number of ways, we have: 4x3x2xi — 24 
224 



§8.3 Combinations and Peruotations 

According to this formula, the number of different ways in which we 

may select 4 items from a pool of 10 would be: 

^10 10X9X8X7 

* “ 4 X3X2X1 

» 210 combinations 

These combinatorial methods are of special significance in the calcula* 

tion of the binomial probabilities, which is the subject of the following 

section. 

Questions and Problems 

1. Define the following concepts: 

Combination 
Pennutation 
Multiplication Theorem 
Recurring Items 
Tree Diagram 

2. A population can be classified by race in 3 ways, by religion in 3 ways, and by 
nativity in 2 ways. Prepare a tree diagram to show in how many ways persons 
may be classified simultaneously. 

3. How many telephone numbers of 4 digits each can be made from all digits 0 to 9 
if: (a) no telephone number begins with sero? (b) no duplicate digits are 
permitted? (c) alt digits are permitted to recur? 

4. A recreation director wants each boy on a baseball team to have the opportunity 
to play each position for a full game. 

(a) How many games must he pUy? 

(b) How many games would be have to ptay if be used all possible batting orders? 

fi. In the columns ol an IBM card, each labeled 0 to 9 inclutive, how many different 
code numbers can be made using 1, 2, or 3 columns? 

6. It is usually assumed that additional culture traits (discoveries or inventions) 
mcrease the possibilities of new inveotions, since new traits may combine with 
any one of a large number of already existing traits. In principle, this is a 
statistical approach to the problem ol culture growth. Assuming that all traits 
may freely combine, how many permutations could be formed with 2, 3, 4, 5, or 
dtraitsf UTiat genenlitstiaa does tbss s a g gest? 

7. Bliat would be the more probable bridge hand, any specified set of 13 cards, or 
a complete suit? Explain. 

5. In a toss of ten coins, wby^are ten heads (tails) so much less probable than 
any one of the other combinations? (Hint; Calculate O’ for N •= 10, and all 
possible values of r.) 


227 



PROBABiinr 


arrangements, with no restrictiOT on recurrences, one must allow the 
multiplication theorem to take over from the factorial principle. Whereas 
all factonals are based on the multiplication theorem, not all cases in 
which this theorem is employed need be factorials, as in the instance of 
telephone numbers. The number of possible fair-digit telephone numbers, 
allowing all ten digits to recur, would be 10 X 10 X 10 X 10 = 10,000. 
The symbolism for such an operation is JV', where r is the number of 
positions to be filled, in this instance 10*. If the N objects are to be 
serialized in N positions, the symbol would be NK 


Calculation of Combinations, r at a Time. There is no difficulty in the 
calculation of the number of combinations, when N objects are taken N 
at a time, for there can be only one combination. A problem ensues 
only when fewer than N objects arc used to form the combinations — 
when we take them r at a time. 

As in the case of permutations, when N is small, the subsidiary combina- 
tions can be found by inspection. A group of four children, A, B, C, D, 
taken two at a time, could combine in the following six ways: 

AB BC CD 

AC BD 

AD 


The partial permutations of the same group, two at a time, would total 
to 12. Clearly, there will always be fewer combinations than permuta- 
tions, for any given N and r. 

With larger N's, it is tedious to determine the number of combinations 
by means of a tree diagram. Instead it is more convenient to compute 
first the number of permutations of If objects, taken r at a time. The 
second step consists in clearing the result of these permutations by divid- 
ing by r!, which is the number of ways each combination can be permuted. 
In other words, since each combination can have rl permutations, we may 
obtain the number of combinations by dividing the number of permuta- 
tions by rl This division obviously reduces the permutations to the 
corresponding number of combinations. The formula is as follows: • 

P" 

Cr - ^ (8.3.3) 

where N = total number of items 

r = number of items is combinadon 


* In many texti in college nlgebn, at veil u in etatisUcs, the formula ia written: 


ty 


m 

rj (W - r)t 


formulation i» obtabed bymuUi;dyiiigboth meraben of the fraetbn by — r)!, 
uus produebg complete permutatiooe in both the numerator and denominator, which 
u conridered by some to be a eimptec procedure. i 
226 



§8.4 Bikouul Pbobabilitizs: Ikdcfexoent Eilms 


1 

“4 

where p ° the probability of a male 
q = the probability of a female 

But the outcomes may occur in re\'er« order — the girl on the first 
trial and the boy on the second trial — making a total of two ways in 
which the specified joint event can occur. Therefore, the probability 
of a male birth and a female birth, irrcspecti%'e of the order of outcomes 
would be 

P«-K/)-=|+| 

“2 

Thus, the psobability <jf a specified joint event is the probability of any 
given joint event weighted by the number of ways in which it can occur. 

Similarly, we may establish the probability of guessiiig 3 right and 2 
wrong on a True-False test of five items. We first find the probability 
of 3 right and 2 wrong in that order: 

ppp93-p»g* 

- (§)»(i)* 

■A 

where p “.probability of guessing right 
q “ probability of guessing wrong 

Next, we detemune the number of ways in which three rights (K) and 
two wrongs (Vt) could occur on five trials. By listing, we discover a 
total of 10 waj's: 

RRRWW WRWKR 

RRWRW RRWWR 

RWRRW RWWRR 

WRRRW RWRWR 

WKRWR WWRRR 

Cornhiaing tie two results, ffvstrirestaprobsbUitj’of^iortbeepedSed 
joint outcome, 3 rights and 2 wrongs. 

Full Set oj Binomial Probabilities. By identical logic, we could obtain 
the probabilities of 2 and 4 rights, respectively. The probability of all 
5 right, by guessing, would be -ij, soce there is only one way of obtaining 
5 successes on five trials. Likewise, the probability of all wrong on five 

229 



PBOBiBonr 


Section Four 

Binomial Probabilities: 

Independent Events 

The Binomial. Every poasibility set obviously may be reduced to two 
mutually exclusive possibilities, and the resulting outcomes labeled success 
and failure. The outcome termed success may be an elementary event, 
such as a male birth, or it may be one of a group of elementary events, 
such as four aces in a deck of playii^ cards. In these instances, the proba* 
bilities of success would be ^ and respectively. But however defined, 
whenever the possibility set con^ts of only two alternatives, there can 
be correspondingly only two probabilities: the probability of success, 
p, and the probability of failure, q. Hence, the probabilities of the two* 
fold set may always be written: 

(p + 9)-1.00 

Since the expression on the left is the algebraic sum of two terms, it is 
a hinomtal.* 

Now, while the layman's interest is almost always centered on the 
probability of success on a single trial, Hie statistician’s interest usually 
lies in the probability of r successes on At trials. He is interested in the 
general quantitative laws that govern the behavior of events on repeated 
trials, and which enable hinr to predict the likelihood of their occurrence. 
For example, he may desire the probability of two heads on successive 
throws, or the probability of a 5 and 5 when two dice are thrown together. 
We have already shown that the probability of a 5 and 5 is since a 
5 and 5 may occur in only one way. On the other hand, the probability 
of a 3 and a 5 is since a 3 and 5 may occur in two ways. Analogously, 
the probability that the faces of two dice will sum to 7 is and the 
probability that they will sum to 8 is 
Clearly, a specified outcome may occur in more than one way. Hence, 
in order to calculate the probability of that outcome, we must always 
determine the number of ways in which it can occur; otherwise we can- 
not properly weight its probal^ty. As we have done before, we pursue 
a very simple example: the prolability of parents having a boy and a 
girl (ignoring multiple births). We first calculate the probability of a 
male birth and a female birth in that order. This is, according to the 
product theorem for independent events: 

' In the snslyelg that follows, outeomea are assumed to be independent from trial to 
^^I'^tbat IB, the probabilities of anccess and failure are constant from trial to 


228 



5 8.4 Binomial PBOBABiLmEs: Iksefendcnt Events 

This probability series naturally sums to unity, since eveij* possible 
joint outcome is accommodated herein. 

Binomial Probabilily Distribution. Since probabOity is identical Tvitb 
relative frequency, the binomial probabilities may be xdewed as the 
frequencj’ distribution of the varbble "r successes on X trials,” r taking 
all integral values from zero through N. Such a distribution is appropri- 
ately entitled a binomial probability (fistribution, and may be displayed in 
either tabular or graphic form. Thus; where p = ? = i, and .V = 5, 

as in the foregoing example, the binomial probability Liblc would be 
constituted as shown in Table 8.4.2. The corresponding histogram is 
shown in Figure 8.4.1. 


Table 8.4.2 

Binomial Frej>xoney Table, N = 6, 
P-.5 


r 

Prir) 

5 

A 

4 

A 

3 

i? 

2 

if 

1 

A 

0 

A 


1.00 


3/ean and SD of Binomial Distribution. By delimtion, the mean of a 
binomial distribution is the average number of successes on all trials; 
it is therefore the expected number per trial. The SD is, of course, an 
average of chance variation around that mean e.Tpectation. Both quan- 


Fiocbe 8.4.1 llUtoffram, 
Bwwwial Prtbnbilxt'i Dis- 
tribution (A' ~ 5, p " .5) 



8CCCZ&SCS 


231 




Pbobability 


trials would be tlieic being only one p<Bsible way of getting every 
item wrong. All of the«e lesults may be usefully assembled in a single 
display which suggests the law of their formation (Table 8.4.1). 


Table 8.4.1 Combinaftons of Rights and Wrongs, Five True-False Items 



XuMBER Right 

Totai 


S 

4 

3 2 

1 

0 


RRRRR RRRRW RRRWlf WWWRR ll'lVWWR WWWWll 

RRRWR RRwivR wivanw rnviVRW 

RRWRR RIVIVRR WRRWW WWRMW 

RWRRR WTVBRR RRWWW IVRWWIV 
iniRRR IVRWRR RWTIWIV RWUTIW 

WRRWR RWIVRW 

WRRRW RWWWR 

RWRWR WRWRW 

RTVRRW WRIVTVR * 

RRWniV mVRWR 


Combi- 

nations 


5 

10 10 

5 

1 

i 

|taSI 


A 

is a 

A 

A 



From an inspection of this complete set of binomial probabilities, we 
itw.y discern the general rule*, to find the probability of r successes in 
N independent trials, find the probability of r successes and fl — r failures 
in that order, and then weight that probability by the number of ways 
in which r successes can occur. But, according to the rule on combina- 
tions, the drawing of r success® from N trials is simply Hence, 
the probability of r successes on N trials is: 

Prfr successes) = (8.4.1) 

The Expanded Binomial. Were we to express each probability in the 
above notation and arrange them in order from five successes to none 
at all, we would have the following sequence: 

CIpV. C5p^5‘, Clp'q\ c^py 

Aijd, were we to sum th®e terms, we wwild have an instance of the 
expanded btnomial, since the algebrwc results correspond to raising the 
binomial (p + q) to the Nth. power, (p + ?)"; in our example, (J + J)*: 

:fe + A + il+4l+A + ^= 1 


230 








§8.4 BlSOilUI. PROBABlLmES: lNT>XPE!a>E»T EVE?fT8 


NmiBEB 

FaiLcta 

w 

Fr« 

10 

H-OS)’" (.95)' 

9 

10(.05)*{.95)* 

8 

45(.05)‘ (.95)* 

7 

120(.05)» (.95)» 

6 

210(.05)' (.95)* 

5 

252(.05)‘ (.95)‘ 

4 

210(.05)*(.95)* 

3 

120(.05)' (.95)’ 

2 

45(.05)‘ (.95)» 

1 

10(.05)* (.95)* 

0 

1(.05)» (.95)« 


flunk on each trial is 5 per cent — or, in other words, the probability 
of any given student failing is 5 per cent. Nevertheless, in an extreme 
case, the whole class of 10 may conceivably flunk — but not very often — 
and still be consistent with the theoretical average of 5 per cent failures. 
The probability that the whole class would fail is represented by the 

first term of the eipaosion: W - 10 , 2 , 0 , oW. OO O i WO ' 
this is an extremely improbable event. However, a freak event does 
occasionally occur, however miraculous it may seem when it happens 
to >* 00 . Tlus particular event is only 15 times as unlikely as a specific 
combination of cards in any given hand of bridge. 

Bincmial and Normal Dulnhutions- By means of the binomial expan- 
sion it is a simple matter to determine, for example, the probabilities 
of 3, 2, 1, and 0 boys in families of three children. However, this method 
of calculating probabilities becomes cumbersome when the number of 
trials is large — for example, when we wish to calculate the probability 
of 90 or more heads in 100 throws or of 60 or more rights by guessing 
on a True-False test of 100 items. Confronted by such problems, early 
statisticians posed the question of whether the binomial distribution ap- 
proaches some fixed pattern as N gets larger, which in turn might be 
used to provide approximately the deared probabifities. 

Thb interesting and fruitful possibility was explored and resolved by 
De Moivre over 200 years ago in his discovery that the binomial distri- 
bution more and more nearly resembles the normal cur%’e as A' increases 
withcRit limit. In fact, N has only to exceed 20 in order to produce a 
very good fit between binonual and normal distributions, prorided that 
p is not too divergent from i- This eonvergence of the discrete binomial 

233 




Pkobabilitt 


tities may be directly computed in the usual manner. Thus, to find the 
mean, we (1) weight each value by its corresponding probability (fre- 
quency); (2) sum these weighted values; and (3) divide that sum by 
the total probability (frequency). Since the sum of the probabilities is 
always unity, the mean of the binomial probability distribution is simply 
the sum of the values weighted by their r^pective probabilities. The 
SD IS analogously computed. In Table 8.4.3, we illustrate the compu- 
tation of the mean of the binomial frequency distribution of Table 8.4.2. 


Table 8.4.3 

Computation of Mean, 
Btnomtal Frequency Dis- 
tribution 


r 

Pr(r) 

Pf(f) X r 

5 


A 

4 


a 

3 

a 

a 

2 

a 

a 

1 

A 

* 

0 

A 

0 


1.00 

11-2.5 








Practically, such cumbersome calculations are unnecessary, since the 
main of the binomial distribution of r succes ses on ff trials is always 
P oi usually written Np, and the SD is VpgoT}?, usually written 
Applying these formulas to the above data we obtain: 

X^^Np (8.4.2) 

X = 5(i) = 2.5 

<r - (8.4.3) 

o “ V6(J)(i) = VTSs - 1.1 

Empirical Probabililitt. The binomial expansion need not result in a 
symmetrical distribution, nor need it refer only to a priori probabilities. 
If, for instance, over a period of time, 5 per cent of the freshmen in Eng- 
lish I fail to pass, 95 per cent would succeed in passing. Presumably, 
future students would conform to those probabilities. The correspond- 
ing binomial would therefore be: (.05 -1- .95) 1. Accordingly, we 

may ask illustratively with what frequency the various combinations 
of successes and failures would occur by chance in the long run in classes 
of 10 students. The solution is provided by the successive terms of the 
expanded bin omial shown in Table 8-4.4- In this situation, a successful 
232 




§8.4 Bd.'ouial Pbobabilities; iNDEPEsmrr Events 

Areas. We need onJj* to express an observed r-va!ue as a standard devi- 
ate, and then to evaluate that standard deviate by means of this table. 
Thus, to find the probabilitj’ of guessing CO items or more right on a 
True-False test of 100 items, we first express 60 as a standard deviate 
and then determine the frequencj’ with which more exceptional values 
are expected to occur. To convert 60 to standard form, we require of 
course the mean and SD of the distribution: 

X = Np o = y/Npq 

= 100(i) - Vl00(i)(i) 

= 50 = 

= 5 

By means of these values, we convert 60 to a standard de^^ate in the 
usual manner, with one minor adjustment. Since the normal curve is 
based on a continuous variable, we treat discrete 60 as the midpoint of 
an interval extending from 59.5 to 60.5. Since our problem is to deter- 
mine the probability of 60 or more right by guessing, rather than more 
than 60, our standard deviate is calculated on 59.5 instead of 60.5. 
r~Np 5 9.5 - 50 
VjVm ® 

- 1.9 

From the table we find that appK>.TunateIy 3 per cent of all items in a 
normal distribution lie beyond 1.9 sigmas; hence, the probability of 
obtaining 60 or more by guessing is approximately 3 per cent. In other 
words, in a class of guessers, 3 per cent could expect to score 60 or higher. 

Although 3 per cent of a large class of students could be expected to 
answer correctly 60 or more out of 100 True-False questions by sheer 
chance, we cannot infer from such an observed score that they actually 
did guess. An obtained score of 60, for example, could simply mean that 
the student controlled 60 per cent of the subject matter. How, then, 
is the professor, or any other observer, to determine the correct explana- 
tion of such a score? The answer is: on the basis of the statistical evi- 
dence, he simply cannot know for sure. The central issue of what is 
known in statistical interpretation as deeizion-making is now joined: 
whether to accept the hypothesis of chance (also known as the null 
ftVpoUiesia), or to explain the score of 60 by the presence of some determin- 
ing factor such as studiously acquired knowledge or piossibly even cheat- 
ing. There is no formula for adjudicating between these two alternatives, 
although as the probability the chance event becomes smaller and 
smaller, the chance hj-pothesis of guessing becomes correspondingly less 
and less acceptable. Such questions will be treated in Chapter 12 on 
statistical inference. 


235 



PROBABIUTY 



FJOiraB 8.4.2a Hutofram, Pre^Ut^ DitirihiHon 

{U m 6. p~ £) 

on the continuous normal distribution is illustrated in Figures 8.4.2a &nd 
8.4.2b, where p * and N ^ S and 10, respectively. 

The practical significance of this equivalence between the binomial 
and nonnal probabilities inheres in the fact that the binomial probabili* 
ties may be handily obtained from the usually accessible Table of Nonnal 



FiaoKE 8.4.2b Hittosram, Pr^AabUity Dislfibution 
(N = W.p = .n 


234 



§5.4 Bujojiial Phobabilities: Indepesdent Etentb 

Feller, IVUliam, i4n Introduelion to FhifcoWZiiy Theory ond Its AppHeatiori*, 
2d edition. John Wiley & Sons, Inc., New York, 1957. Volume I, 
Chapters I and 2. 

Kemeny, John G., J. Laurie Snell, and Gerald L. Tbompaon. /nfrodue- 
Iton to Finite Malhemalia. Prentice-Hall, Inc,, Englewood Cliffs, 
N.J., 1956. Chapters 3 and 4. 

Laplace, Marquia Pierre Simon de, A PkUatoyhieol Eetay on ProhabHUiet. 
Dover Publications, loe.. New York, 1951. 

Leviruon, Horace C., Seunee <»f Chanet. Rinehart & Company, loe., 
New York, 1950. Chapters 14 and IS. 

Nagel, Ernest, "Principles of the TTie^ of Probability,” Vol. I, No. 6, 
/nirniaiumai Encyclopedia «/ Unified Saeneet. The University of 
Chicago Press, Chicago, 1939. 

Neyman, Jerzy, Ptril Course tn PnAtoMity and Statietiee. Henry Holt 
and Company, New York, 1950. Chapters 1 and 2. 

Polya, George, PaUenu of PtauaMe Infertnee, VoU II of MaOiematie* 
md PlawtibU Reatoning. iVinceton University Press, IVineeton, 
NJ., 1954. ChapUrs 14 and IS. 

Savage, Leonard J., 7Ae Fonndolum of Sfetufia. John Wiley A Sons, Ine., 
New York, 1954. Chapter 4. 

Venn, John, TA« Logie of CKanee. The Maneillan Company, London and 
New York, 1876. 


257 



pHOBiBIUTT 


Question and Problems 

1. Define the following concepts: 

Bmosiial 

Binomial Probabilities 
Espanded Binomial 
Binomial Probability Distribution 
Independent Outcome 

2. It has been contended that the excess of males over females at birth in the 
United States is due to the tendency of families to have no children after a male 
birth. Would such a parental bias affect the sex ratio? 

3. A person claims he can discriminate between the tastes of two cigarettes. He 
is given Brands A and B in pairs, 5 times. Assuming he merely guesses, calcu* 
late the probability of S correct choices; 4 correct choices; at least 4 correct 
choices. What chance factors would interfere with his judgment? How many 
rights would you require to be convinced that he has the power to discriminate? 

4. Are the following probabilities identical: S heads b & tosses of 1 cob; 5 heads 
m 1 toss of 5 cobs? State the rule. 

5. In toisbg a cob, how long a run of beads could one throw without arousing 
suspicion of a biased cob? Explab your reasonbg. 

6. In a throw of two corns, what a the probability of obtabbg exactly 50 per cent 
heads and 50 per cent tails? Calculate also for 4, 6 , 8 , and 10 cobs. What 
happens to the probabilities of an even split as the number of cobs is bereased 
from two to ten? 

7. If a True-False examination consbls of six questions, what is the probability 
that the student will mark by chance: 

(a) all correctly? 

(b) half of them correctly? 

(e) all incorrectly? 

8. Does the expanded bbomial always result b a symmetrical distribution? 
Calculate the distribution of fives and no-fives with 2, 4, and 5 dice. What 
generalization can you offer? (Hint; Prepare a histogram.) 

9. The Old Testament states that Jacob bad 12 sons. We may hypothesize either 
(a) that there were actually no daughters among the children, or (b) that the 
daughters were not recorded. Determbe the probability of the first alterna- 
tive, and state which assumption you favor. What other factors, besides 
statistical evidence, might enter into yoor decision? 

Srlsccrd BLeerrekces 

Boole, George, An Itaettifction ^ Oie Lav* of Thmefii, unabridged repro- 
duction of the 1854 edHkm. Dover Publications, Inc., New Yorb 
Chapter 16. 


236 



§ 9.1 Concept of Stattsticai Association 

Although no event is conceivable in isolation, it is not a simple matter 
to detect with what variables it is linked. For example, the association 
between delinquency and intelligence was once thought to be very close, 
but in later analysis was found to be quite remote. This shift in inter- 
pretation was a result not only of more carefully controlled studies of 
delinquency and its associated soaal factors, but also of a redefinition 
of the test-intelligence norms themselves. 

To man, nature is infinitely complex; hence, the search for such under- 
lying relations c/in never be completed. There are innumerable factors 
whose relations are so intricate that the “final" factor in the production 
of an event can never be attained, much less measured. However, in 
its unfulfilled quest for certainty, our common sense b^;ins very early 
to construct patterns of relationsWps on the basis of which it endeavors 
to comprehend the past, understand the present, and thereby anticipate 
the future. 

PaUems of Relationship. The patterns of association are, of course, 
continuously revised in the trial and error of daily experience. In the 
process, the obsen’er mentally quantifies and summariies his^ observa- 
tions by first noting the factors which seem to him to “cause" or to be 
linked with the event and, second, by noting the frequency with which 
he successfully anticipates or forecasts it. In this casual manner, every 
person begins to cultivate the habits of association and becomes an^ in- 
formal statUtician. He practices intuitively the principle of correlation. 
In fact, much sociologj' effectively employs such intuitive statistics, 
skillfully put together by alert, widely-traveled scholars, made withwt 
benefit of pencil and paper calculation or technical procedures. Ind^, 
many who are critical of the utility of statistical procedures nevertheless 
unwittingly employ them in Ibis unofficbl manner. 

For some purposes, a rough subjective approximation of a correlation 
is fairly satisfactory, but for scientific purposes more accurate measure- 
ments are desired. Such precision is not a simple matter to achie%e. 
The difficulty lies in the corapleuly of the patterns of relationship in 
terms of which wc view and orpinize the world. Some of the salient 
features of this complexity in patterns may be formulated as follows. 
(1) every event is the outcome of multiple factors; (2) the force of these 
respective factors varies in intensity, and (3) may flow in one or more 
directions in producing its effects; (4) the factors are in constant inter- 
action. and consequently (5) they may reinforce, counteract, and cancel 
one another. We illustrate with a popular example: the halfback in a 
football game, whose scoring power tends to be associated positively with 
running ability, is either obstructed or aided in actual performance by 
the condition of the field, his fatigue, the tj^pe of. plays called, to sav 
nothing of the varied activities of the other twenty-one players on the 

239 



Measurement of Association: 
Qiialitative Variables 



Section One 

Concept of Statistical Association 


Prineiple of CoMingeney. It is an axiom of science, os well as of 
common sense, that no event in nature “Just happen?,” but always occurs 
under very specific, known or unknowi., circumstances. An event la 
therefore never to bo viewed in isolation. It must be considered os a 
product of the joint operation of many forces, each of which contributes 
a variable element to the observed outcome. Thus, the site of family 
may be dependent on such (actors ns age at marTbge, level of income, 
extent of employment of the mother outside the home, and the religious 
ideofi^ entertained by the parents- Parofe success of released prisoners 
may be related to the type of crime committed, age, and historj' of re- 
cidivism. Some^of these factors tend to promote and accelerate, others 
to retard in varying degrees or even inhibit, the occurrence of the event, 
or at least to modify its character or magnitude. Thus, the religious 
factor may foster large families, while the economic factor m-ay restrain 
that tendency. In any case, wc cannot predict the degree of parole suc- 
cess, or explain the siie of the family, unconditionally, but only on the 
basis of specifically designated factors, or variables, on which the out- 
come is contingent. The human observer does not possess absolute 
knowledge; he must use one event aa a cue to anticipate another. His 
understanding is therefore grounded in the principle of conlingency. And 
it is in accordance with that prindple that the sciences, both physical 
and social, have set up their methods and objectives: fl) to identify the 
variables (determining factors) assodated with an event; (2) to uncover 
the patterns of association between the variables and the event; and 
(3) to measure the strength of that assoebtion. 

238 



AssocuTiotf or QuAuriTivE Vabubles 


field. This association between scoring power and running ability may 
therefore not be visible to the fans in the stadium. Just so is the correla- 
tion between income and aiie trf family beclouded by such factors as 
religion, age at marriage, occupaticm of breadwinner, and education. 
It may at first seem a hopeless task to disentangle these networks of 
relationships and to subject them to statistical reasoning. Nevertheless, 
in a large number of observations, the essential relation can be expected 
to shine through the haze formed by the multiplicity-of factors. 

This problem is probably less difficult in the physical than in the social 
sciences. The physical scientist, by means of available laboratory con- 
trols, is to a certain extent able to segregate and manipulate his elements 
and to replicate his careful and undisturbed obser\'ations, whereas the 
social scientist is often obliged to accept data which are like unrefined 
ores from “nature in the raw” and to alembic materials from widely 
dispersed sources and a variety of setting. He is therefore compelled 
to employ statistical controls, tince, in general, laboratory controls are 
closed to him. 

irAol Art Evideneti of Rtlalumihipt How can we be sure that factors 
are interconnected? And, havingdiscoveredarelatiensliip,howmay wede- 
termine the decree or intensity of that relation? Broadly speaking, there are 
two earmarks of such linkage; (1) joint occurrence of attributes, and 
(2) parallel changes in two or more series of quantitative observations. 

The relative frequency with which certain attributes happen together 
is probably the most elemenUry basis of by judgment of assoebtion. 
This is the ■principlt of joinl occurrence. Statistical variables, like human 
beings, are usually judged "by the company they keep.” For example, 
if delinquency is more often found in boys than in girls, we conclude that 
delinquency is associated with “boyness.” The strength of this associa- 
tion will vary according to other factors such as the boy’s age, the type 
of delinquency, and many other elements in the pattern, all of which will 
render the statistical application of an apparently simple principle quite 
complicated. Hence, it need hardly be reiterated here that some system 
of tabubtion and classification is necessary as an aid not only in establish- 
ing an association, but also in determining ita strength. 

Second, if in two series of quantitative data, a unit change in one 
variable is paralleled with some degree of regubrity by a comparable 
change in the other series — that is, if they move together — we conclude 
that they are somehow tied tc^etfaer, and that there is an assoebtion 
between the two sets of data. For examplei as income declines, the size 
of family tends to increase; and, if the observations endure through a 
rather extensive range — that ia, for the entire range of families of all 
sizes and of incomes of varying amounts — the evidence of a relation b 
strengthened. Thb is called the prineijiU of eovarialion. 

240 



§ 9^ Ym.E’8 Q 


But the important point is that delioquencies are distributed between 
boys and girls, and that auto acddents are shared by both men and 
women drivers. We must therefore inquire in what proportion the 
limited supply of violations or auto accidents are divided between the 
sexes. To which sex do delinquency or accidents preleienlially attach 
themselves? To estimate the degree of affinity for eiUier sex, we must 
know the delinquency rate for both sexes; a delinquency rate for boj's 
can be said to be high only when the rate for the girls is known. Let 
us therefore provide the rates for females and Democrats in the 2X1 
tables, and analyze the results. 


Table923. Sy. 2 Tables 


Boyt Girl* 

Delinquent 10% 0% 

Non-delinquent 90 100 

l00% 100% 

Brj>. Dm. 

Isolationist 50% 30% 

Internationalist 50 70 

ioo% ioo% 

Hale FtmHe 
A Grades 10% 20% 

Other 90 JO 

100% 100% 

Men Women 

Accident 10% 20% 

No Accident 90 SO 

100% 100% 


Soutm: H7poth«U««l 


The delinquency rate of boys now turns out to be rather high, since 
even a 10 per cent delinquency rate is higher than no delinquency at all. 
Similarly, the grades of college men are below the norm; the Republi- 
cans display a marked propensity toward isolationism; and women driv- 
ers show a marked susceptibility to mishaps. 

The fact is that we have now introduced a standard of judgment against 
which the 2X1 table may be compared. Some standard is inevitably 
and unwittingly introduced by every oteerver, and it is the function of 
statistical procedures to make the standard explicit. Hence, a 2 X 2 
table is the minimum for a dependable conclusion on association. 

The SX ^ Table. A 2X 1 table merdy presents ttie two subclasses of 
a single variable, whereas a 2 X 2 table presents the subclasses of two 
Variables cross-classified by one another. In Table 9.2.3, a case is clas- 
sified as boy and dehnquent, as and non-delinquent. This double 
classification automatically establishes the distribution of joint occur- 
rences between a given sex and behavior which is, of course, basic to 
the understanding of the phenomenon of association. In the bypo- 

243 









AfesocunoN or QtiAtrrATivs Vaiuables 


Section Two 

Cotfficient oj Association: Yult’s ^ 

Co ne of the simplest meiisures of association is known as the Coefficitni 
of Association, or, more informally, as Ytdc’t Q. This measure is designed 
to reflect the degree of association between a pair of qualitative variables, 
arranged in a 2 X 2 (fourfold) table^ In his study of cross-classification 
the student has already acquired the sense of inferring association, and 
even cause-and-eflect relationships, between dichotomous variables. But 
at that juncture, we did not seek to compute a single over-all measure 
which would reflect the strength of that relation. Yule's Q and other 
measures of association are designed to do just that, since they are sum- 
marizing measures for bivariate data, analogous to the mean for univari- 
ate data. 

Inadequacy of the A X / Table for Estimate of Astoeiaiion. Many persona 
untrained in quantitative reasoning naively succumb to the temptation 
of drawing conclusions on the degree of association from a 2 X 1 table, 
instead of from a 2 X 2 table, which b the minimum for that purpose. 
The deceptive ease with which such erroneous deductions can be made 
is illustrated in the four 2X1 tables depicted in Table 9.2.1. 

TalU92A S X J Talks 


Boys Per Cent 

Defbquent 10% 

Non-delinquent 90 

100% 

Reyublirans Per Cent 

Isolationist S0% 

Interaationalist 60 

100% 

Maks Per Cent 

A Grades 10% 

Other 90 

100% 

Femah Drivers Per Cent 
Accident 20% 

No Accident 80 

100% 


ourev: Hrpothetic&l 


Since only 10 boys out of 100 are delinquent, a layman may be misled* 
into the conclusion that there is a very weak association between boyness 
and delinquency; similarly, that women are not accident prone. Since 
Republicans divide 60-50 on isolation, th^ would seem to be no striking 
tendency for Republicans to favor one or the other foreign policy. 

242 






5 92 Yclk's Q 


sies in reflecting gradations of assodation. Yale’s Q, being our first 
example of a measure of associatum, in its simpUcity supplies us with a 
convenient introduction to this type of anal>’si3 which will, in its turn, 
throw considerable light on all subsequent instances. How sensitive, 
then, is <2 to the variations in the data? > 

First let us assume that not all the delinquents arc boj-s, but rather 
that 1 and 5 girls in 50, respectively, are delinquents. (See a and 5 of 
Table 9.2.4.) In both instances, there is still a positive association be- 

Tahle 9-2.4 Sensilieily of Q 






m 

n 


m 

g 


13 

o 

^9 


Del. 

Kon-del. 


\M 

B 


IQI 




Q 

IQI 


50 50 1 100 


tween delinquency and boj-ness, but not as complete as in the first table. 
Thus, the value of the tndc.v is reduced; it reaches lero when a delio* 
quent is just as likely to be a boy as a girL Sex and delinquency arc then 
said to bo independent; pure chance prevuib. Statbtically speaking, 
the intemal cell ratios ore identical with the corresponding marginal 
ratios. You might as well toss a coin for your prediction; hence, Q 0. 

Effect of Marffinal Ratios. We have soon bow Q reflects the changes in 
row (column) ratios, with marginals remaining constant. However, Q 
b not at all sensitive to changes in marginal ratios, so long os the corre- 
sponding cell ratio remains fixed. For purposes in hand, let us construe 
the column subtotab as samples of 100 boys and 50 girb (Table 9.2.5). 

Table 9.2.5 Stability of Q 


Del. 

Notice!. 

(«) 

m 



0 G 






B G 



10 

3 

13 

137 


i 

a 

1C 

ISi 


m 

B 


to 

47 

EUBI 


Q 

■ 


m 


too too 

200 


no 53 



0--"7 

WMitBtM 


2<5 









Assocutiom or Qualitative Variables 


Table9^.5 

Ntimber of Deltnquenli by Sex 



Boy Gir 

Told 

Delinquent 

20 

0 

20 

Non-delinquent 

30 

60 

SO 

Total 

50 

50 

100 


thetica! group of 100 children sboxvn in Table 9.2.3, there are 20 delin- 
quents, all of whom arc boys. Most of the children arc, of course, non- 
delinquent. But, given a delinquency, there is perfect prediction — 
that I?, complete certainty about the rex of the delinquent. Hence, 
taking the data at face value, we may say that delinquency is completely 
explained by hejmes, since there is obviously no element of girlness re- 
quired for its occurrence. The completeness of tliis explanation should 
register it.«elf in any index tliat may be contrived for that purpose. 

In order to provide a single measure of association for such a 2 X 2 
table, the English statistician, G. Udney Yule, proposed the following 
coefficient of association, winch he labeled "Q” in honor of the nine- 
teenth century statistician, Qudtelet: 


0 


ad- he 
od + fw 


( 9 . 2 . 1 ) 


where o, 6, e, d arc the joint frequencies conventionally arranged in tho 
fourfold table in the following manner: 


b 

d 


Applying the formula to Table 9.2.3: 

(20X50) -(30X0) 
" <20 X 50) + (30 X 0) 
1000 -0 
“ 1000 + 0 


This index of unity — a result already anticipated — b the obvious 
measure of the eomplele association between malencss and delinquency, 
since all delinquents are-malc, and all females are necessarily non-delin- 
quent. 


Sensitivily of Q. Any index must be able to discriminate to some extent 
between slight variations in the data. It will later become apparent 
that each of the various measures of assoebtion has its own idiosyncra- 
244 





5 92 Yule’s Q 


Table92.6 Observed and Chance Frequencies 


Del. 

Non-del. 


Observed 
B G 


15 5 

35 45 


Expected 
B G 


the corresponding deficienqr. There i? therefore a positive affinity be- 
tween boys and delinquency. However, since the excess of 15 over 10 
is fairly moderate, the index reveals only a modemie association: Q .59. 
But it is the amount ot this excess that constitutes, by definition, the 
degree of association. This difference between the o^ryed and ex- 
p ected fr equencies is conventionally lymbolized os (0 — £) — an expres- 
sion that has achieved an almost epigrammatic currency in statistical 
methods of investigation, which the student will learn to appreciate. 

Signs. Although mathematically the formula for Q will yield a sign, it 
may not seem very raeaningful when applied to attributes such as sex, 
religion, or race, which do not constitute a hierarchy. In the foregoing 
illustration, therefore, one may say that maleness is positively associated 
with delinquency, but it cannot be verbalired as “the higher the sex, the 
higher the delinquency rate.” Nevertheless, we may be guided by the 
algebraic sign which alwaj's refers to the association indicated in cells 
a and d. \Ybcn the sign is positive, the joint frequencies ol the attri- 
butes intersecting in cells a and d exceed chance (and are therefore linked 
together); when the sign is ncgati%'e, the frequencies in cells a and d 
arc less than would be e.xpecled by chance, and therefore “repel" each 
other. By corollary, cells 6 and c ere read in the opposite manner. The 
assumption is, of course, that the alphabetical designations of cells are 
made in the conventional order. 

Type of Data to Il'AicA Q Is AppfioaWe. While Q is singularly appropriate 
for genuine attributes, nevertheless this measure may at limes be applied 
to continuous variables by prudently cutting them into dichotomies. 
For example, persons may be subdassiiied as under and over 21 years, 
and responses may be cither posith'e or negative, although these latter 
usually shade into each other (see Table 0.2.7). 

There arc, however, two cauuons which should always be obseia'cd 

247 




Association ot Quautattve VinwcLES 

One might intuitively suppose that the observed association of 0 “ -27 
may be due in part to the fact that there are tnice as many boys as girls 
in the total sample of 150, and that by doubling the sample of girls the 
index might increase. Let us, therefore, double the number of girls, 
retaining, however, the same rate of delinquency among the girls. It 
mil be observed that the value of Q remains unchanged under this infla- 
tion. Similarly, if we double the supply of delinquents from 13 to 26 
by multiplying the frequencies of that tow by 2, the value of Q again 
remains unchanged 

In general, therefore, a change in the relative sire of the marginals docs 
not affect the value of Q, so long as the ratios within either columns or 
rotis remain undisturbed. As we shall ecc later, this does not hold for 
d, another index of association for fourfold tables. In this respect Q is 
more stable than is 4>- ^’hether this type of insensitivity is desirable — 
and should therefore be considered a virtue of stability — or whether it 
is a defect of the formula, must be left to the good judgment of tbe worker 
in terms of hia purposes. It is doubtless important to know that the 
number of observations in a subclass or stratum does not, of itself, influ- 
ence the size of Q. 

Definition of Aaociction: Observed and Chance Frequencies. The above 
illustrative tables yielded, among other things, the important generaliza- 
tion that, when the internal cell ratios and corresponding marginal ratios 
coincide, and there is no association, the index must bo zero. Only when 
these two sets of ratios differ, and to the extent of that difference, is 
there statistical association. Indeed, the formula for Q rests on this 
principle of the discrepancy between the observed and chance cell fre- 
quencies. 

The chance frequencies are easily derivable from the marginal frequen- 
cies according to the following logic: the boys constitute 50 per cent of 
the children; hence, if the two sexes are equally susceptible to delin- 
quency (or, in other words, if the sex factor has no influence on the pro- 
duction of delinquency — i.c., if sex and delinquency are independent), 
the boys would also have 50 per cent of (he deliaquencies. To express 
this in still another way, if dellnqucndca were divided impartially (by 
chance) among the two sexes, the boys’ quota would be 50 per cent, or 
in terms of our example, the boys would have 10 of the 20 delinquencies. 
Arithmetically put: 


In essence, we simply adjust the cell ratios to be identical with the cor- 
responding marginal ratios (Table 9.2.6). Tlie boys actually show 6 more 
delinquencies than would be expected by chance, wliile the girls show 
246 



§ 92 Yeti’s Q 


Table9*2^b Unlilbe 3f<tr(^'nol j 


50 

50 

100 


competitive with other measures of sssociatioa, ia those situations where 
the cases fall predominantly in three of the four cells. Such a distribu- 
tion, wluch is evidence of a one-way association, is possible only when 
marginal sets are unlike. Hence, Q ia a likely choice whenever marginal 
sets are dissimilar. The reason for this will become clearer after <t> 
other indexes of association have been studied. 

Interpretation of Yxde't Q. For all its apparent simplicity and precision, 
Q possesses no simple quantitative meaning, and is not convertible into 
a specific prediction.* For example, if the association between sex and 
delinquency is .5, one cannot say that 50 per cent of a given sex is delin- 
quent, or that one sex is twice os delinquent ns another, or that sex ac- 
counts for 50 per cent of all delinquency. A C-v-alue of .8 is not twice 
as strong as .4, although it is stronger than all values less than .8. 

Not even is a coefficient of I.OO wholly unambiguous. In the previous 
example, the perfect Q did not depend on the tendency of boys to become 
delinquent, but rather on the rdative tendency that a delinquency be 
committed by a boy. Actually, in absolute terms, the tendency of a 
boy to be delinquent could be very weak, and Q still be 1.00, provided 
that no girls are delinquent. A coefficient of 1.00 therefore regbters the 
presence of one-way association, but does not reflect the degree of pre- 
diction in the other direction, although that too may be complete. We 
may say, therefore, that Q is primarily a measure of one-way asso- 
ciation. 

In general, it would seem to be sound practice to quote the original 
2x2 tabulation along with the calculated value of Q. Not only is the 
table necessary in order to identify linkages between specific attributes; 
it also permits the reader to examine for himself the detailed structure 
of the relationship. But if it is a single index that is required, then one 
must accept the limitations of such a condensation, just as the mean 
was accepted as a limited representaUve of the full array. 

* A precise probabilistic interpretation of Q bas recently been formulated. See 
L. A. Goodman and W. II. Kruskal, “Meanneaof Association for Cross Classifications,” 
/oumal of Vit Amrrican Stalittieal AttoeiaHon, XLIX, 1954, p. 750. 


249 




AssoaATiON or Qualitative Variabli:^ 


Toble^2.1 
Opinion tj/ Agt 



Opinion 

Yes No 
% % 

-21 

Age 

21-1- 

CO 

40 


40 

CO 



100 

100 


in such dichotomlzstioo. (1) Such compression of a mass of continuous, 
detailed observations into two broad dichotomies may be an expensive 
waste of perfectly good data, and naturally reduces the potential pre- 
cision of the resulting measure. Instead of cavalierly discarding costly 
accuracy, it might be advisable to use other available measures which 
can take such precision into account. (2) A less obvious trap for the 
unwary operator is the more or loss arbitrary location of the cutting 
point, which introduces an unpredictable effect upon the ultimate index. 
If, in the above table, the cutting point had been set at 30 years instead 
of 21, the cell frei^uendes might have been radically changed, and the 
Q-measure substantially altered. Such arbitrary deebions gravely reduce 
the reliability and comparabiUty of any measure, iocluding Q. The 
student will remember the general principle that all grouping bos its 
hasards; but when everything b staked on one cutting point, the pr»> 
cariousness of the undertaking b increased. 

The usefulness of Q also v'arics with the general pattern of distri- 
bution of the data. These patterns may be particularized as follows: 
(1) the divbion of frequencies within each marginal set; (2) the similarity 
(dissimilarity) between the two marginal eels; and (3) the internal dis- 
tribution of Cell frequencies. Thus.an individual marginal set may be sym- 
metrical (equal divbion) or skewed; the two marginal sets may be more 
or less identical in degree of symmetry; and the cell frequencies may 
be appro.timately evenly dbpersed in all four cells, concentrated in three 
celb, or in two diagonal celb (see Tables 9.2.8a and 9.2.8b). 

It will become increaringly evident that Q b most useful, and least 


TahU 9>2.8a Like Slarginal Stls 


248 




§ 9.2 Yole’s Q 


(b) Convert ecU frequencies to percentages of total (949), and compute Q. 

(c) Compare answers. What principle in respect to Q is suggested? 


Soldiers' Answers to Question: “Is the Army 
Table 5.2.9 giving you a good chance to show what you can 
dor 


Soldier Class 

1 Respoxse 

Total 

N 







Regulars 

52% 4S% 

100% 

300 

Selectees 

30% 70% 

100% 

649 


SouTM Saraud A Stoufler <t oi.. Ammcan Adjiuinunt During 

Armu Litt. Vol I of 5<udu« in 3r>n<it Ptgdicloev ta World War II. Princeton 
University Press. Pnneetan, N J.. 19t0, p. 73. 


9. In comparing primitive tribes, a simple statistical technique is: (1) to tabulate 
the number of traits that two tribes have in common; (2) the number present 
inonebutnotmtheother; and (3) the number of traits absent in both. Below 
are three such tabulations (Table 92.10) based on selected Indian tribes in 
California. 

(a) Compute Q for each table. 

(b) According to these results, which two tribes are cultural); most similar? 


Common Culture Traits, Selected Indian Tribes, Cali- 
ToWe 9.2.10 fomia 


709 

142 

851 

P 

ChiluJa 

A 

527 

»45 

S72 

23S 

799 

1,037 

236 

84S 

1,0S4 

947 

941 

1,8SS 


763 1,193 

1,956 


650 

82 

2S6 

936 

943 

S61 

732 1,147| 

1,879 


10. Study Table 92.11 and answer the foQowing questions: 

(a) If you arc a smoker, what is the probability of developing lung cancer? 

(b) If you have lung cancer, what b the probability of being a smoker? 

(c) If you do not have lung cancer, what b the probability of being a smoker? 

(d) Wiiat b the probability of anyone having lung cancer? 

(e) Does the Q-vnlue indicate that lung cancer b assoebted with smoking, or 
that smoking leads to lung cancer? 


251 









Association or QcALtTATm: Variables 


Questions and Problems 

1. Define the following concepwi 

2 X 1 Table 
2X2 Table 
Statistical Association 
Statistical Independence 
One-Way Association 
Two-Way Association 
Expected Frequency 
Observed Frequency 
Marginal Distribution 
Like Marginal Seta 
Unlike Marginal Sets 
Joint Frequency Distribution 

2. In a study of mental patients, **94 per cent showed evidence of status conflict 
before onset of mental illness.” (Source: Kingsley Davis, “Mental Uygiene 
and Class Structure,” Pryehiatry, 1, 1933, pp. 55-65.) 

(a) Does this observation demonstrate an association between status eon^et 
and mental illness? Explain your answer. 

(b) Vlrat would be your conclu^n If 94 per cent of the norms! population also 
had status conflict? 

(c) If 50 per cent of the normal population had status conflict, what would 
you conclude? 

(d) Is a 2 X 1 table euf&cient to prove a relationship? 

3. Of the criminals in a given population, SO per cent completed S yean of school- 
ing or less. Of the non-crisdnals, 20 per cent had completed more than 6 yean 
of schooling. Prepare a 2 X 2 table and explain why Q 0. 

4. How can one know by mere inspection of a 2 X 2 table that Q ia unity? 

5. How is tbe sign of Q to be interpretedT Discuss with reference both to at- 
tributes and to continuous data. 9 

6. (a) Form a complete 2X2 percentage table based on the following data: 

of a total of 650 young people, 80 per cent are boys, and 10 per cent are 
delinquents; 10 per cent of the boya ate delinquent. 

(b) Convert the percentages to absolute frequencies. 

7. (a) Construct a 2 X 2 table for the following data; 960 of 1,500 community 

leaders replied “ Yea ” to the question ** If a person wanted to make a speech 
in your commuiuty against churches and reli^ons, should he be allowed to 
speak?" Of 897 non-leaders from the same communities, 350 answered 
“Yes.” (Source: S. A. Stouffer, Communwm, Conformity, and Cut? 
Libertiti, Doubleday & Ccunpany, Inc,, Garden City, N.Y., 1955, p. 33.) 
(b) What conclusion un the relation between leadership and tolerance ia sug- 
gested by this table? 

3. For Table 9.2.9; 

(a) Convert percentages to fiequeneies^ and compute Q. 

250 



§ 92 Ycle’s Q 


(b) Convert ecU {requencies to pmentages of tota\ (949), and compute Q. 

(c) Compare answers. What prioeiple in respect to Q is suggested? 


Table 5^.9 


jSoWier#* ^nsirers to Question; “/* Ike Army 
giving you a goad chance to show what you can 


do?” 


Soldier Class 

Respokse 


N 




Yes A'o 



Regulars 

52% 48% 

100% 

300 

Selectees 

30% 70% 


649 


Source. Stmud A Stou7err( at., Tht Anurwan SoldieT; AiiyuSmenl During 
Armii £>/(, VoL I of Studiti in &cwil pgg^alan tn TTm-U ITar II, Priaceton 
Dnicernty Pres, Princeton. N J., 1W9. p. 73. 


9. In comparing primitive tribes, a luinplestalistjca] lecbaique is: (J) to tabuhte 
the number of traits that two tribes ha\'e in commoni (2) the number present 
in one but not in the other; and (3) the number of traits absent in both. Below 
aro three such tabulations (Table 9.2.10) based on selected Indian tribes in 
California. 

(a) Compute Q for each table. 

(b) According to these results, which two tribes are culturally most similar? 


Common Culture r«itto, Selected Indian Tribes, Cali- 
Table 9.2.10 fomia 


K'n-nl- 



Koroib 



ffupa 

P 

A 



P 

A 



P 

A 

|709, 

1 ^^-1 

851 

P 

|527| 

1 345 

872 j 

Pi 

CAiluto 

A 

|650j 

286 

23S 

7991 

1,037 

aI 

|236 

84s| 

I,0S4 j 

1 

861 

947 

941 ! 

1,8S8 

j 

763 1.193| 

l,9oG 1 


732 

1,147 


10. Study Table 9.2.11 and answer the following questions; 

(a) If you are a smoker, what if the probability of developing lung cancer? 

(b) If you have lung cancer, what is the probability of being a smoker? 

(e) If you do not hare lung cancer, what is the probability of being a smoker? 

(d) What is the probability of anyone having lung cancer? 

(e) Does the Q-value indicate that lung cancer is associated with smoking, or 
that smoking leads to lung cancer? 


251 







Association or Quautativt Vabiableb 


Table92.n 

Smoking and Lung 
Cancer 


Botire*: HypolheCical 



Smokers 

Non- 

Smokers 


Lui^Canccr 

9 1 

1 

10 

No lamg Cancer 

91 

U9 

240 


100 

150 

250 


11. In Sun County, the voters &nd non-voten were tabulated by sex. The asso- 
ciation between male and votbg as measured by Q was .32. In Rain County, 
a similar tabulation led to a Q of .&!. Given these Q-values, which of the 
following inferences are valid? (Suggestion: prepare dununy table.) 

(a) A male in Rain County is twice as likely to vote as a male in Sun County. 

(b) The percentage of women who voted in Sun County U twice as great as 
tho percentage of women who voted in Rain County. 

(e) In Rain County, a male ia more likely to vote than a female. 

(d) In Sun County, a female ia more likely to vote than a male. 

(e) What value would Q take if males and females were equally likely to vote? 
(0 A male is mom likely to vole than a female in both counties. 


Section Three 


Tfu Phi Coefficiettf 


Compariton oj Q‘ 'I'be statisUc ^ (the Greek lower-case letter 

pAt) has certain resemblances to Q, but in other respects it is different. 
Like Q, it ia applicable only to 2 X 2 tables of true dichotomies that have 
no gradation in value, or to continuous variables that can be justifiably 
dichotomized. Similarly, 0 measures the intensity of association which 
the distribution of the joint frequencies reflects. As for the formulas 
of <f> and Q, the numerators are idenUcal, but the denominators are differ- 
ently constructed. 


ad— be 

V(a-|-b)(c-l-d)(o-hc)(6 + d) 


(9.3.1) 


where a, b, c, and d are the famiUar cell frequencies, and the sums of 
the individual cell frequencies are the marginal subtotals. This coef- 
ficient lakes the sign of the numerator, which carries the same interpre- 
tation as the sign of Q. 

In order to demonstrate differences in the performance of Q and 
we avail ourselves again of the cross-tabulation of delinquents by sex 
in which Q was unity (see Table 9.2.3). Applying the <{> formula to the 
same data, however, we obtain a measure only half as large (Table 9.3.1). 


252 




i 5.3 The Phi CoEmcizNT 


Tatfe 9.3.1 

Delinquenis by Sex 



B 

G 


Del. 

20 

0 

20 

Kon-del. 

30 

50 

80 


50 

50 

100 


1000 

- 0 


V'20.8Q 

•30-S0 

1000 



2000 



= .50 




Q is necessarily unity, since all delinquents are boys. However, the 
converse, that all boys are delinquents, obviously does not hold. Hence, 
whose lormula is so contrived as to reflect this mutuality of relation- 
ship, is only .50. Associattqp is complete but not perfect, or absolute, terms 
used by Yule and Kendall. Clearly, ^ and Q do not measure the same 
aspects of association in the fourfold table. Essentially, the difference 
between the two iode.Tes lies in the fact that th i* fr.miula fnr tf> ts so d e- 
s igned that the dq p^ oL bilateral jissQciation Js.rgflfectedJn-n-aingle 
Jndex . The 0 coefficient "picks up" whatever twoway association 
there is between the two sets of attributes. Since it measures only the 
reciprocal relation between A' and Y, it necessarily ascribes equal in- 
fluence to both variables; it may therefore be said to be rererri&fe. 

This may be very simply illustrated (Table 5.3.2} in the case of per- 
fect two-way association, a eitualion in which there is no exception in 
either direction: all boys are delinquent, and all delinquents are boys. 
As a consequence of this perfect two-way association, the joint occur- 


TflWe 9.3.2 

Perfect Two-Way Associaiion 



B 

G 


Del 

20 

0 

20 

Koo-deL 1 

0 

80 

SO 


20 

SO 

ioo 

A = — 

1600 

-0 



>/20.S0.20-SO 


1600 


253 





AssocuTioN or QoAttTATrvE Variables 


rences arc restricted to one diagonal of the 2X2 table, and the other 
diagonally located cells arc vacant. Since each category completely 
"etplams” the other, 0 must equal unity. To put it arithmetically, 
two zeros in one diagonal necessarily produce 0 — 1. This principle of re- 
versibility holds good also for any intermediate value of 0 between zero 
and unity when the paired attributes only partially explain each other. 
As this degree of mutuality diminishes, 0 likewise is reduced. 

Q and 0 of course coincide in the presence of perfect two-way associa- 
tion for the reason that complete one-way association (Q - 1 ) is a 
necessary element in perfect two-way association (0 — 1). 


Problm of ilarginal FregueneitB. Like Q, 0 of course reflects gradations 
in the intensity of association. Unlike 0, however, 0 is responsive to 
changes in marginal ratios, since these alTcct the possible degree of two- 
way association. In Table 9.3.3, the sample of delinquents is only one- 
ninth as large as that of the non-delinquents. But when the samples 
of deUnqudnts and non-deUnquents are equalized, 0 is raised from .20 
to .35, whereas Q would have remained unchanged. This sensitivity of 
0 stems from the fact that any uniform inflation of frequencies in 'one 
row alters the ratios in column frequencies, both marginal and internal, 
and thereby alters (in this instance, improves) the potentiality of a mutual 
relationship. Q and 0 are ieft unaltered if the whole table is multiplied 
through by a constant (Table 9.3.3); in consequence, both 0 and Q can 
be calculated from percentages to the base N. 


Table 9.3.3 Effed of Marginal DitlrifnUiortB 


100 100 1 200 


This sensitivity of the index to shifts in marginal ratios may appear 
to be a defect of the formula. If a pair of subtotaia exerts such an in- 
fluence on the value of 0, a statistical worker could rig his coeffleient 
up or down by tampering with subtotals. Moreover, two or more other- 
wise reliable studies may lack comparability merely because of the un- 
controlled factor of Bubsamplc aze. 

Since marginals are occasionally subject to quite arbitrary choice, and 
sometimes arc necessarily quite unbalanced, one may inquire whether 
254 






S 9.3 The Phi CoEmciENT 


there b a “natural” or "ideal” margina! ratio. Is perhaps equality in 
site of paired samples to be considered the most valid? 

Tbb may at first glance appear to be an inviting prospect, since the 
resulting 50-50 divbion of the marginal subtotab would theoretically 
allow the cell frequencies to rary maximally, and thereby \-ieId everj' 
possible gradation of 4>. But ^nee many sociologically interesting di- 
chotomies (e.g., delinquent and non-delinquent, married and divorced) 
exist in nature only in dbproportionate supply, it would be rableading 
to set up equal subtotab. And even if an equal dh-bion in attributes 
were arbitrarily set up for the independent rarbble, the dependent 
variable would srill have to be allowed to vary as it may. Thus, in much 
social inquiry, there b no escape from unequal, and even dbproportionate, 
subsamples. In those instances, if the worker has any mbgiN-ings about 
the validity of hb obt^cd d^-eoefficient, he can always quote hb entire 
tiny table for the information of hb reader. 

tf> oj Unity Pessiblt Only rsitk Identie<d ZIoTginal Sets. It has already 
been stated that all correlation formulas are ideally devbed so as to 
permit the indexes to vary between mo and unity. Any circumstance 
which limits the range within which the coefficient can vary blembbes 
the normiog function of the coefficient, thereby reducing tbe usefubess 
of tbe index. It b as though percentage distributions were not permitted 
to vary between 0 aod 100. The ^-coefficient b in e-xactly that di- 
lemma when marginal sets are not identical Under these conditions, 
no amount of juggling cell frequencies (Table 0.3.4) will make it possible 
to enter seros in two diagonal cells while maintaining intact the given 
observed marglnab. 


Unlike 3/aryinoI Sets and 


Although identity of marginal sets is necessary for to be unity, it 
b not sufficient of itself to guarantee perfect conelaUoa. Identical 
marginab merely penr.il the value of ^ to range within the ideal limit.*. 
It b always the internal ratios wMdi determine the correlation index, as 
b apparent in Table 9.3.5. 


255 









AssocuTioN or Qoautative Variables 


fences are restricted to one diagonal of the 2 X 2 table, and the other 
diagonally located cells are vacant. Since each category completely 
‘‘explains” the other, 0 must equal unity. To put it arithmetically, 
two zeros in one diagonal necessarily produce ^ 1. This principle of re- 

versibility holds good also for any intermediate value of 4> between zero 
and unity when the paired attributes only partially explain each other. 
As this degree of mutuality diminishes, 0 likewise « reduced. 

Q and <t> of course coincide in the presence of perfect two-way associa- 
tion for the reason that complete one-way association (Q •• 1) i* n 
necessary element in perfect two-way association — 1). 


Problem of Mar^nal Frequeneiee. Like <?, ^ of course refiects gradations 
in the intensity of association. Uobke Q, however, ^ is responsive to 
changes in margina) ratios, since these affect the possible degree of two- 
way association. In Table 9.3.3, the eamplc of delinquents is only one- 
ninth as large as that of the non-delinquents. But when the samples 
of dellnquiiQts and non-delinqueats are equalized, is raised from .20 
to .35, whereas Q would haNt remained unchanged. This sensitivity of 
^ stems from the fact that any uniform inflation of frequencies In one 
row alters the ratios in column frequencies, both marginal end internal, 
and thereby altera (in tliis instance, improves) the potentiality of a mutual 
relationship. Q and ^ arc left unaltered if the whole table is multiplied 
through by a constant (Table 0.3.3); in consequence, both ^ and Q can 
be calculated from percentages to the base /f. 


Table FJ'ed of ilarqinal DidrUmiiom 


Del. 

Non-del. 


100 JOO I 200 


This sensitivity of the index to shifts in marginal ratios may appear 
to be a defect of the formula. If n pair of subtotals exerts such an in- 
fluence on the value of <3, a statistical worker could rig his coefficient 
up or down by tampering with subtotals. Moreover, two or more other- 
xrise reliable studies maj lack comparability merely because of the un- 
controlled factor of subsample oze. 

Since marginals are occasionally Eubject to quite arbitrary choice, and 
sometimes are necessarily quite unbalanced, one may inquire whether 
254 





§ 9.3 The Phi CoErnciEsr 


there is a “natural” or "ideal” marginal ratio. Is perhaps equality in 
sire of paired samples to be considered the most valid? 

This may at first glance appear to be an in\’iting prospect, since the 
resulting 50-50 division of the marginal subtotals would theoretically 
allow the cell frequencies to vary maximally, and thereby yield every 
possible gradation of But since many sociologically interesting di- 
chotomies (e.g., delinquent and non-delinquent, married and dh’orced) 
exist in nature only in disproportionate supply, it would be misleading 
to set up equal subtotals. And even if an equal di%ision in attributes 
were arbitrarily set up for the independent variable, the dependent 
variable would still have to be allowed to vary as it may. Thus, in much 
social inquiry, there is no escape from unequal, and even disproportionate, 
subsamples. In those instances, if the worker has any misgiN'ings about 
the validity of Ids obtained Coefficient, he can always quote his entire 
t'my table for the infonnation of his reader. 

^ of Cfntty Possible Only tcilh Idmttcal Marginal Sets. It has already 
been stat^ that all correlation formulas are ideally devised so as to 
permit the indexes to vary between zero and unity. Any circumstance 
which limits the range within which the coefficient can vary blemishes 
the norming function of the coefficient, thereby reducing the usefulness 
of the bdex. It is as though percentage distributions were sot permitted 
to vary between 0 and 100. The Coefficient is b exactly that di- 
lemma when margbal sets are not identical. Under these conditions, 
no amount of jugglbg cell frequencies (Table 9-3.4) will make it possible 
to enter zeros b two diagonal cells while maintainbg btact the given 
observed marginals. 


Taile 9.3.4 Unlike Marginal Sets and if > — 


Although identity of marginal sets is necessarj* for ^ to be unity, it 
is not sufficient of itself to guarantee perfect correlation. Identical 
margin.als merely permit the value of ^ to range within the ideal limits- 
It is always the iniemel ratios whidi determine the correlation bdex, as 
is apparent in Table 9.3-5. 


255 









AasoaATiorr or QcAUTATnrt Vasiablu 



CalmJation 0 } ilfortmum Where & ^ of unity ia impossible to attain, 
it may nevertheless seem desirable to determine the maximum possible 
for the given unlike marginaf sets, and this maximum ^ may then be 
set up as a standard against vhich to assess the obtained value. The 
maximum value of ^ may be obtained by placing a xero in one cell, which 
guarantees complete one-way association, and then arranging the other 
frequencies so as to maximite the divergence between ad and he. Table 
9.3.4 exemplifies this exploratory procedure. 

However, instead of resorting to such experimental sbuSing, can 
be readily computed for any given marginals by 'means of the following 
formula: 



in wluch 2 and e •• the larger and smaller individual subtotals of 
the respective marginal sets 
2, » the largest subtotal in the table 
Ij — the brger subtotal in the other set 
Applying this formula to the original distribution of Table 9.3.4, we find 
that; 



This numerical result corresponds to the Rvalue of —.61 obtained by trial 
and error. 

The ^ formula does not pose the same problem on the sero end of 
the coefficient range. Althot^ ^ may show a variable maximum, the 
possible minimum is always xero. Any set of marginal ratios, irrespec- 
tive of how unbalanced or unlike, is compatible with a xero correlation. 
The reason is that any marginal ratio can be theoretically matched within 
columns and rows (Table 9.3.6) so aa to yield ^ 0. Hence, we have 

no occasion to speak of <^«u, but only of 
2S6 








S 9^ The Phi CormciENT 


100 100 I 200 


100 100 I 200 


Funclion of The fact that 4> can attain a maximuni of unity only 

ivhen marginal sets are identical strips it of a portion of its norming 
capacity, for two or more coefficients are not comparable if they happen 
to be based on diverse marginals. It has been suggested that an escape 
from this dilemma is to express the observed as a proportion of the 
maximum possible d*- But such a figure still does not convey any con- 
ception of the degree of association. By that technique, an observed 
^of ,2 and&i^au of .2 would yield a" corrected” of unity I This would 
certainly be an overstatement of the existing association. 

In any event, it is never clear whether a low value of ^ is due to the 
inhibiting force of the marginal frequencies or to a weak intrinsic relation 
between the variables as evidenced by cell frequencies. 

Type of Data to TTAtcA ^ la dppficahfc. In the first place, we assume 
that the categories are true dicbotonues, such as race, religion, employ- 
ment status, or other categories which show no gradation in value. How- 
ever, continuous data may be compressed into dichotomies and treated 
like a pair of attributes, as has been previously suggested. There is 
nothing in the mathematics of the ^ formula which forbids such utili- 
sation of data, for abstract mathematics is quite oblivious to the mean- 
ing of the substantive empirical data to which the measures refer. The 
real problem lies in the interpretation of the data. 

Secondly, the distribution of marginal frequencies must be taken into 
account in judging the appropriateness of Since there is a question 
of the fundamental usefulness of ^ when ia appreciably lower than 

unity, ^ should be very cautiously applied under these circumstances; 
and this for two reasons: (1) when ^ b applied to unbalanced sets of 
marginals, in which two-way association is negligible, it wOl necessarily 
understate the one-way relationship; and (2) a series of ^ -values with 
varied maxima cease to be comparable. This predicament is serious, 
since compMabilily is one of the base objecUves of norming in the first 
place. 

What renders the use of ^ all the more problematical for application 

257 






AssOCIATiO!* or QCAUTATn’E VARIABLES 

to social data is the fact that matched marginals occur very rarely in 
nature, as has already been implied. Wc t^'ould therefore either have 
to abandon the cmplojinent of except in such rare cases aa those in 
which identical marginals were present, or tolerate an approximation 
when the discrepancy’ it not too great and the requirements of the prob* 
lera not too stringent. Practically speaking, this is a common and justi- 
fiable solution of such a dilemma, since statistics thrives on approximations. 

Usts of <t> and Q Compared. Since a large proportion of sociological re- 
search deals with Btfribufes, the foorfofd (able would seem very useful 
to sociologists. And in the event that it is desirable to express associa- 
tion in terms of either Q or the following guiding principles may serve 
08 rough criteria of choice between them: 

(1) lYhcn the marginal gets are rety unlike, it will be reasonable to use 
0 in order to aalvage ahaievcr one-way association there is. 

(2) ^\'hcn marginal vets are approtimatcly alike, use 4 , in order to test the 

presence of tn o-^ ay a-saoeiatioa. ^ 

(3) Villen ^ is low for aoy reason, trj- Q. 

In spite of its limitations, the ^oefEdcot, as distinguished from Q, 
possesses certain characteristics which should not be overlooked and tend 
to explain its current preferential use over Q. First, its derivation U 
sbitiitically more rigorous than that of Q; second, it is matbematialJy 
equivalent to the Pcarsonlan product-moment correlation (Chapter 10, 
Section 3); and third, it is functionally related to chi-square (Chapter 0, 
Section 4). IloMCver, in spite of a certain elegance, 4 >8 hampered — 
as is, for tliat matter, every other index— by certain characteristics 
which limit its practical utility. For all its neatness, it still cannot take 
over the functions which the less useful Q can perform better. 

Questions and Problems 

1. While in tnuning in llie United SUlcs (1943) and later m Europe (1915). the 
same group of tOrt soldiers weiessfced: “In general, how do you feel most of the 
time, in good spirits or in low spirHsT" Their replies are shown in Table 9.3.7: 

TaWe 9.3.7 


Soure*: Ssnniel A. Stoufler tt al., TXe Armncan SUdur: 
Ad}a$l»itm Ditntio Amir Li/i. X'oL J of SIiiAk* in Social 
Pove^wnriB WorU War II, Prinenton UniTtreitg Proas. 
Pnocotea. K J . 1949, p. 184. 


Soldier Morale, U.S. 
{1943) and Europe 
(,1945), ■ Percentage 
tHstributim 


1945 

1943 

Good Spirits Low Spirits 


Good Spirits 

27 

17 

44 

Low Spirits 

9 

47 

56 


36 

64 

100 


258 




1 9^ Thi Phi CoEiTicir>T 

(a) What percentage of the total gnnip responded in the same way on both 
occasions? 

(b) Compute 

(c) What percentage of each group changed their response? 

(d) Interpret these results: compare the information which the respective 
answers yield. 

2. The following question was put to 624 high school boys: “Which boy in the 
senior class seems to j-ou the most poised in social situations?'’ Their votes, 
classified by religious background of choosers and chosen, ore shown in Table 
9.3£a. 


ratfc9.3^a 

Sociometrie Choicet by Retiyuma 


(b) What does Table 9.3.Sa nggest about the ef ect of reh'gioua ethnoccntrism? 
(e) What proportion of the respective choosers chose their own social group? 
(d) What proportion of the total chose their own group? 

3. The votes for the ten most popular boys (chosen most frequently) were claasiSed 
by religion of choosers and chosen, as ehown in Table 9.3.Sb. 


BackyrauTid 


(a) Compute ^ and 


Chooier 

Che»en 

Jews Xon.Jews 


Jews 

239 

44 

2S3 

Non^ews 

77 

2M 

341 


316 

30$ 

624 


Seam: Toby. ’’UaiTtPsaltfUe and 

PartieuUruUc Fatten ia R«9* AanpiatBC' 
4aifni:an Sceifl^ffCeal fUtie*, XVIII. ISSS. 
p. 134. 


ToWe 93.8b 

(SoCTOTtefric Choieet by Rdiyious 
Baekynnind, Ten Moti Papular 
Boyt 


(a) What proportion of the total (261) choae their own group? 

(b) What proportion of each group choee tbrir own group? 

4 . AH other boj-a were dassfied smUatly (Tdtle 9.3Ac). 


CAookt 

Choten 

Jews ?*on.Jrprs 


Jew* 

12i 

25 

149 

?Con.JewB 

57 

53 

112 


!S1 

go 

261 


259 





Association or Quautative Variables 


Tobfe 9.3.8c 

Soeiomeln'e Choices iy Religiffiu 
Background, Boys Not Among 
First Tm 


Chooser 

KoR.Jeirs 


Chosen 

Jews Non-Jews 


(a) Compute ^ for Tables 9.3£b and 9^Ac. 

(b) Would these results modify the conclu^a based on Table 9.3Aa! 

5. In an investigation of nonliterale cultuies it is found that 19 out of 34 agricul- 
tural tribes, and 4 out of 24 non-agricultural tribes praetiee slavery. 


Slaetry by Type of Economy, B8 
Table 9.3.9 Pfimilw Cultures 



Agricultural 

Noo-agriculturel 

Tolal 

Slavery 

19 

4 

23 

No slavery 

15 

20 

35 

Total 

34 

24 

53 


Ssurcc: L«o W. Simmeas. “Sutislical CemUtient is tb* 
Seiene* e( SocKtr.'* la O. P. tlurdock. Studies in (As Sritwt 
«/ Seetety. Ftetrnted U AlbeH Q. KtUer, Ys]> Vniv«mt7 PrcM. 
K(« Hit«d. Coaa.. 1937, Tab)* 7. 


(a) Wldch index, Q or seems more appropriate to this tabuIationT 

(b) Which variable would you conrider depcudest? 

(e) tSTiich attributes afford the best ptedicUon of the others? 

(d) Prepare percentage dUtributions in each direction and compare. 

6. (a) Compute Q and ^ for Table 9,3.10. 

(b) Why do they differ so greatly? la Q preferable? 

(c) Compute the maximum ^ and compare with observed ip. 

(d) Discuss the adequacy of the following summaries of the tabulation: 

Color and occupation are correlated. 

Color and occupation are positively correlated. 

The correlation between color and occupation is + .29. 

The convlalion between color and occupation, as measured by 0, is S. 
More non-white women are employed in domestic service than white 
womeiL 

Being non-white is associated with being employed as a domestic. 

A larger proportion of noa-wlute women are in domestic service 
wWte women. 


260 




5 9^ The Pm CoErncrcNT 

In 400,000 employed vomeo ia Chicago, 1940, beiog non-white is asso- 
ciated with being employed in domestic semee and the degiee o! 
that association, as measured is S. 

Type of Oeevpaiion by 
Table 9.3.10 Racf, Employed Femalet 
(m '000), Chicago , 1 0^0 



Oeeupalion 


Color 

Other than 
Domestic 
Service 

Domestic 

Service 

Total 

l\Tute 

359 

23 

3S2 

Non-while 

17 

10 

27 

Total 

376 

33 

409 


8oure«- O. O- Dunna sad &«<rartr D*vil. “As 
AlUrnktiT* (o E«oiocie«I ComUiion.” Anmots 
SaeMatioH Utrire. XVIII. lftS3. p. W. 


7. Investigaton interested in political attitudes asked a group of people the follow* 
ing question; “If you had to choose for president between a man who hat 
had esperienee in go\*erement and a man who haa hsd experience in businest, 
which would you choose?” Table 9J.II presenla a summary of the replies 
claarified by major political parties. 


Preferred Oecupalional Back- 
TaWc 9.3.11 ground of Preeident byilajor 
Polilieat Pariier, 19iS 


Reipmu 

Partg of Ketpondeni 
Democratic Republican 

Tofa/ 

Dusincfa 

3t 

161 

195 

Covrmraent 

IIS 

47 

162 

Total 

149 

203 

337 


261 






Association of Quahtative Variables 


(c) Discuss ia your own words the sssocistioa between party affiliation and 
type of person preferred for present. 

8. In general, which seems to he more inforcnative, the analysis of rows and col- 
umns, or the comprehensive index? 


Section Four 

Coefficient of Contingency 

Yule’s Q and to dichotomous classifications, but it is 

often not only undesirable, but even impossible, to reduce sociological 
data to a dichotomy. Although male-female, insane-sane, yes-no data 
may logically fall into a 2 X 2 table, the equally prevalent socio-economic 
classes, religious denominations, types of crime, gradations of attitude, 
and many other social groupings require more than two categories. To 
cross-tabulate such polyfomcus classifications, a 2 X 2 contingency table 
ia insufficient; a mantfold fable ts required. 

We shall consider a formula which answers the requlrexcent of such 
manifold distributions, namely, Karl Pearson’s coe^icient of eoniingeray, 
usually abbreviated to C. It is based on the deviations of the observ^ 
cell frequencies from those frequencies expected on the assumption of 
chance, as measured by x* (r«td “chi-square”). 

Chi-Square. The statistic x* has a rather elaborate mathematical 
grounding, and yet it is a very conventional quantity, useful in those 
statistical situations where it b necessary to measure the discrepancy 
between the observed and expected frequencies. Here, however, it will 
be compactly set forth for practical purposes without fundamental ex- 
planation, since it involves principles which may not be readily com- 
prehensible at this stage of the student’s development. The steps in 
the computation of x’ ere as follows: 

(X) Compute the expected frequencies (B).. 

(2) Subtract the expected from the observed frequencies (0 — E). 

(3) Square each difference (O — £)*. 

(4) Divide each squared discrepancy by its expected frequency, 

^ ’ thereby normisg each absolute dbcrepancy on its own 


(5) Sum the resulting ratios, definition, x*. 

The formula will therefore read: 




262 


(9.4.1) 



}9.4 CoErnciEST or CoxTOfOEKcr 


Given a joint frequency table such as the following, the calculation of 
X* would proceed as shown in Table 9.4.1. 


IS 

6 


Compulation of Expected Frequenciet. The expected frequencies are com- 
puted in exactly the same manner as has already been cxplaineddn Section 
2 of this chapter. As was then defined, chance frequencies in rows and 
columns are proportions! to corresponding marginal totals. In the above 
example, the proportion of all items in the first row is § ; hence, the chance 
frequencies in the first row would be one-third of the respective column 
totals. Thus, the expected frequency in Row 1, Column 1, is (J) X (24) 
= 8. One could compute the other expected frequencies by subtraction 


Computalton of X*, ^ X ^ Contxngenq^ 
Tails 9.4.1 Table, nVAiAed 


IrcreasECTJO.v 

or: 

RoV AND Col. 

0 

E 

O-E 

(0-£)’ 


BH 

ts 


10 

100 

12.S0 


2 

l9 

-10 

JOO 

S.35 


6 

la 

-10 

100 

0.25 



El 

10 

100 

4.17 

Total 

«) 

e 

0 




from the marginal subtotals. A prudent check on accuracy, however, 
can be secured by independently computing all cell frequencies in the 
aforesaid manner, and totaling them to determine whether they corre- 
spond to the marginal totals. The expected frequency of any cell may 
be routinely calculated by multipljing its marginal frequencies together 
and di^nding by A’. In sjTnbols: 

where Tf «■ marginal frequency of I'th row 

Cj — marginal frequency of /th column 

Coefieient of Conlingeneq. The coefficient of continger.cj’ is designed to 
measure the degree of contingency, or dependence, between t* o vnrublcs 

263 











AssociiTiOK OF Qualitative VAntABLES 


or frets or attributes. Now, since pure chance distribution indicates no 
ns.«oci:ition at all, the more nearly thb pure chance distribution approxi- 
mates the observed distribution, the weaker the affinity must neces- 
sarily be, similarly, tlie greater the discrepancy between the observed 
and chance distributions, the greater must be the association, or depend- 
ence, between the variables. Since x* has been chosen as the measure 
of this di'crcpancy, the higlier the x**vjrluc, the greater the association; 
X’ could therefore be accepted as a rough measure of correlation. 

However, it cannot qualify os a standard correlation measure, since 
Its upper limit varies directly with the number of observations, N, so 
that siieccsfrivc instances of x* wHl he lacking in comparability. In 
other iiordi, raw x*-values are not normed; they do not range from 
zero to unity, and therefore arc not suited for a measure of correlation 
in the accepted scn&e. 

But the following formub, which allows for varying JV's, more or less 
satisfies this requirement of a standard range: 



It will be clear upon examination of the formula that if x’ is brge in 
relation to N, C will approach unity, since numerator and denominator 
Mill be virtually equal; however, if x* is small in relation to N, the 
coefficient will also be small and tend towards lero. If x’ “ 0 (that is, 
if there is no discrepancy between the observed data and pure chance), 
the coefficient will, of course, also be zero because the numerator is zero. 

Dfmonslration oj C. In order to comprehend the type of association 
mea.«ured by the contingency coefficient, let ua set up a hypothetical 
case of obviously perfect correlation between class rnembership and 
political opinion which is not disturbed by exceptions, and a similar 
tabulation wliich is not perfect. In Table 9.4.2a, all members of the 
highest class are conservative, and all conservatives are in the highest 
class without exception in either direction — and so on through the re- 
maining categories. The absence of exceptions to this generalization is 
indic.ated by the six zero entries. No greater perfection of association 
can be conceived, no matter what the marginal distributions, so long 
as they are identical. As members of the social classes move down the 
scale, they aLo men’e uniformly across to the radical end of the opinion 
scale. 

However, in Table 9.4.2b, not all of the frequencies fall on the diagonal; 
but the dominant trend along the diagonal may still be discerned, since 
the larger frequencies are concentrated there. Nevertheless, a con- 
siderable number of persona diverge from the observed trend. In Table 
9.4.23 the prediction that a person of Class I would be conservative 
264 



1 9.4 CoETTicmrr or Gostisoesct 

Comjmtalion oj Continseney Coffieient, Perfed Atsoeia^ 
rcW<9.4ia fion, P<^iftceJ Optnuwi 6y Social CJoss 


SoctaL 

Cuss 

OBStKTtD 

Eiwcm) 

Pditieol Atlitude: 

Cons. Neut. Rad. 



H[ 

High 

13 0 0 

13 

3.S 4.3 4.9 

13 

Middle 

0 15 0 

15 

4.3 5.0 5.7 

15 

Low 

0 0 17 

17 

4.9 5.7 6.4 

17 

Total 

13 15 17 

Bi; 

13 15 17 



c.^r- 

90 




* 90 + 45 


mm 


Soom; Brpelbttta] 


n-ould be correct 100 per ccat of the time. Ob the other hand, in Table 
9.4.2b, such a prediction irould be much less certain, since it Is corrcet 
only 10 out of 15 times. The coefBdent of contiogeDcy would accord'* 
ingly be lower, and is found to be .57. 

FcrmMla Undenieles Dtftrt cf Attociaiion. But C does cot fuactioa 
perfectly according to convention. The formula is not so constructed 
that it can successfully j-ield unity, ei'en though the intrinsic relation 
between the attributes be perfect. Thus, in epite of the evident perfect 
correlation in Table 9.4.2a, the obtained coeffiejent was only .81C instead 
of unity. To increase the complications, this tnaximum obtainable index 
(C.u), the eriuii-aleot of perfect correlatloo, I’aries with the number of 
cells in the table. 

It may be shown that we can calcubte the theoretical Cm* on the sole 
basis of the number of rows (or columns), provided the table is square 
and marginal sets are identicaL For square tables, we find that: 

(9.4J) 

where t — the number of rows 


265 
















Association or Qcalitative Variables 


Tabu 9.4ib Comfmiatim of ContinffCTtci/ Coefideni 


Social 

Class 

Observed 

Expected 

Poltlieal Alfifuder 
Cons. Neut. Rad. 

Total 

Poh'Iifa! Altitude: 
Cons. Neut. Rad. 

Total 


10 


2 

15 

6.6 4.4 

5.0 

15 

Middle 

-7 



16 

6D 4.7 

5.3 

16 

Low 

2 


14 

20 

7.4 5.9 

6.7 

20 

Total 

19 

15 

VI 

51 

19 15 

17 

51 

Row and Col. 

0 

£ 


9 - £ 

(0-£)> 


-D' 

E 

1 1 

10 

5.6 


4.4 

19.36 


3.46 

1 2 

3 

4.4 


-1.4 

1.90 


.45 

1 3 

2 

6.0 


-3.0 

9,00 


1.80 

2 1 

7 

6.0 


1.0 

J.00 


.17 

2 2 

S 

4.7 


3.3 

10.69 


2.32 

2 3 

1 

5.3 


-4.3 

1S.49 


3.49 

3 1 

2 

7.4 


-54 

29.16 


3.94 

3 2 

4 

5.9 


-1.9 

3.61 


.01 

3 3 

14 

6.7 


7.3 

63.29 


7.95 


31 

Sl.O 


0.0 


- 24.19 


^ I 24.1!) 


Source: Hypotbctical 


perfect, provided margins! sets are Meatical. The ratio between the ob- 
tained and the maximum C would therefore be roughly equivalent to 
the conventional measure of association, ranging from 0 to 1, Applying 
this reasoning to Table 9.4.2a where the obtained C was .816, we would 
compute the adjusted C: 


^vhich is obviously correct. 

This adjustment becomes smaller and smaller, and therefore decreas- 
ingly necessary, as the dimenaiona of the table increase. 

266 





59.4 CoEinciENT or CktJmuoEVCT 

The Sign of C. The C-coefficient nonnally carries no agn, and this for 
two simple reasons. (1) Since C is the square root of a number, it may 
be either plus or minus; hence, no definite sign is imposed on the index 
on strictly mathematical grounds. (2) In polytomous tables, the joint 
frequencies need not be restricted to the diagonal cells, even though the 
correlation b perfect; hence, the sign is meaningless. However, if the 
categories may be ordered from high to low, there is no objection to es- 
tablishing the sign by inspection. Thus, in the present example, we may 
speak of a high positive correlation between class status and degree of 
political conservatism, since as standing rises, intensity of con- 

servatism also increases. 

Conditions Under Tl’At'cA C Is Appropriale. (1) As in the case of Q and 
C is designed to measure association between qualitative \’ari3bles. 
(2) The correlation between two series of data is most easily verbalized 
if the categories can be ordered (formulated in graded sequence): for 
example, social classes ranked from high to low, or social attitudes ranked 
from most favorable to least favorable. This ordering, however, is not 
a mathematical requirement. Since x* merely the sum of all the normed 
discrepancies between obsen-ed and expected frequencies, their order is 
teehnicaily immaterial. Furthermore such categories as race and reli- 
gious denomination do not usually lend themselves to natural ordering, 
but they are not thereby excluded from consideration for this t 3 q>c of 
correlation. In such an instance, the index loses nothing in validity, al- 
though the interpretation is likely to be awkward ond verbose. (3) Be- 
cause of the fact that Cw. approaches unity only when the number 
of cells is large, it is sometimes recommended that C be computed only 
for square tables of at least 5X5 dimension. 

As originally formulated by Karl Pearson, the C^cocfiicicnt for attri- 
butes was put forward as an approximation of the product-moment 
correlation coefficient for continuous x'ariables. However, the demanding 
assumption underlying this equivalence — normally distributed variables 
which assure a linear relaUonship — b almost prohibitive for sociological 
data. 

The general trend b, therefore, toward a broadening of its scope of 
application bejxmd that originally concei%-ed by its inventor. What C 
thereby loses in preebion of interpretation, it gains in resourcefulness. 

ia vvam/znlij xss.’pAtgxi ixfgai tiSsamTAiiwa xS feixarily xa 

of normality of marginal dbtributions in rectangular tables of continu- 
ous, dberete, or qualitative data. Its limitations are evident, however, 
from the fact that only square tables can j-ield a perfect correlation 
(otherwise marginab cannot be identical); and indexes dranm from 
unlike tables are not comparable. 


.267 



Association or Qcauta- 


Table9A2b Comjmtalim of Contingency Coe^icient ^ 



Obsei^p 

I 

Class 

PolxlUal AtMude: 
Cons. Neut. Had. 

Total 

Politico' 
Cons. 1 

High 

10 

3 2 

15 

5.6 - y' 

Middle 

7 

8 I 

16 

6.0 .• . ^ 

Low 

2 

4 14 

20 

7.4/- 

Total 

19 

15 17 

51 

19/ ^ 

Row and Col. 

0 

B 0-K 






J 

1 1 

10 

5.6 

4.4 


1 2 

3 

4.4 

-1.4 


1 3 

2 

5.0 

-3X1 


2 1 

7 

6.0 

1.0 


2 2 

8 

4.7 

3.3 


2 3 

1 

5.3 

-4.3 


3 1 

2 

7.4 

-5.4 


3 2 

4 

5.9 

-1.9 


3 3 

14 

6.7 

72 f 



61 

51.0 

o.r 




« / 24.19 

— 




\24.l9 + 5t 



Sourea; Hypothaticil 


perfect, provided marginn! sets an 
lained and the maximum C w 
the conventional measure of asso 
this reasoning to Table 9.4.2a ul 
compute the adjusted C ; 





which is obviously correct. 

This adjustment becomes emsll 
ingiy necessary, as the dimensiiKts 
266 


§9.4 CorinaE>T op CojrnscrscT 


MariUil Adjtulment of Husbands by 
9 . 4.4 Deyne of Attachment to Father 


hfariial Adjustments 

Degree of 

Augment Poot Fair Good ToUd 


Little or oorie 

32(20) 

2S{ ) ; 

15 ( ) 

75 

Moderate 

41 (43) 

47 ( ) 

69 ( ) 

157 

A good deal 

26(35) 

41 ( ) 

61 ( ) 

123 

Veij' close 

2S( ) 

22( ) , 

59( ) 

109 

Total 

127 

133 

204 

469 


Scn>Tt». Ere«sl aad Leoo&t^ 3. CMlnll. Jr.. Prtdielms 

Sueettt OJid Failure iit Sfamaft. Picatin-HaU. Ibo., Co^ewood 
Ctifl*. NJ., 1939. p. 377. 


Selected Referexces 


Geodmsa, L«o, aad WiUiAtn It. Kruskal. "McMurta of Asmutioa for 
Croa* CUMlfioatios,*' Journal of IA« Amman Statistiul Astoeiation, 
XUX, 1»L Pa«« ■T32-76a. 

Guttaiaa, Louis, ““nie QuAliuUre Prrdictioo of a QualiuUre Variato,” 
IB Th* Prtdielion of Pertonal Adju^menJ, editrd bp Paul Her^ 
Social Scicoen Rcecairb CouacS, Nev York, IMl. Pac«9 2&S-263. 

KefidaU, Maurice G., The Adometd Theory of Sseiuties, 5th edition, 
llafarr Pubiiihing Co., Nev York. 1953. Volume 1, Chapter 13. 

Yule, G. UdOT, aod Bfauriee G. KndaU, An Introduetion to the Theory of 
Btatutxa, 14th cchtkm. Rafacr PuhUshbc Co.. York, 195Q. 
Chapter 2. 

Yde, G. Vdoj, The 5<ix<ufiaf Study of Vitervy FeGa5ii^rir. Cambrid^ 
Uiureraitj’ Press, Cambridge, Enflasd, J944. Chapters 7 aod 8. 

ZeVCteh. hlorris, Jr., A Batie Court* in Sodnloyieol SlatCttiet. Tleory 
Qolt aod Compaajr, Tfeir York, 1959. Chapter 7. 


269 



AssocuTtoH or Quautattve Variables 

unless the association is perfect. Aa in so many other situations, we must 
be content with approximations 

Questions and Problems 

1. Define the following concepts: 

Coefficient of Contingency 
Contingency Table 
Polytomous ClassiSeatioa 
Chi«Square 
C«. 

2. (al Compute C for Table Q.4.3. 

(b) Why is C difficult to interpret tor this table? 


Peer Rulings by Social Class, 
Table 9<^'3 High Sdtool Students 


Peer Rating 

Social Class 

I and n in IV 

V 

Total 

EUte 

27 

30 



66 

Good Kids 

8 

114 

133 

Ti 

259 

Grubby Gang 

0 

2 

41 

22 

65 

Total 

35 

148 

183 

26 

390 


Reprioted wvtb p<rmi*«oa from A. B. HoUiDgsbeaii, £7m> 
(oiwi'f routA. p 223. CopTTiebt IMS. Joba Wilsy A 
Boai, Ido. 


3. (a) Calculate the lemaining expected frequencies in Table 8.4.4. 

(b) Compute C. 

(c) Describe veibsUy the relationship between variables. 

(d) Compute Cdu by combining degrees of attachment (little and moderate) 
for a square table. Compare with C. 

4. Compute Cmu for 4 X 4 and 8X8 tables. 

5. Compute Q, and C for the followiDg 2X2 table. Compare results and discuss 
the differences. 


268 




§ 10.1 CoRBEtiTIOS OP Ra^'ts: Rbo 

rank order is based on the concept ‘*more or less,” even though the 
“unit of measure” is subjective, we may conceive of the collective items 
ranked as a quantitative, rather than qualitative triable; and when 
two sets of ranks \-ary together, we speak <rf rank-order correlation. 

Rank-DiSerence Method of CorrelfUimt (p). The simplest correlation of 
ranks is that between only two sets of ranks. When the number of items 
ranked are no more than a half dozen, an observer might by inspection 
be able to draw a rough but acceptable conclusion on the d^ee of cor- 
relation between them. 

Consider the two series of rankings of six pictures by two judges shown 
in Table 10.1.1a. Because the two rankings are duplicates, the discrep- 


Toi/c 10.1.1a 

RankrOrder Correlalion: 
Six Items, T*co Judges; 
Per/cct Positite Corrtla- 
lion 


PicrraE 

JoDCC X 

JcncE Y 

DlTPERE>'Cr 

(D) 

A 

1 

1 

0 

B 

2 

2 

0 

C 

3 

3 

0 

D 

4 

4 

0 

E 

5 

5 

0 

F 

6 

6 

0 


axLctes between the paired rankings are all tero. Furthermoie, there is 
perfect prediction from one rank order to the other. The measure of 
correlation between ranks would therefore logically be unity. 

On the other hand, one set of ranks could be arranged from low to 
high (Table 10.1.1b} instead of from high to low without affecting the 
d^ree of predictability. Utile the two series of ranks are now in exactly 
reverse order, the deviation of each rank relative to the mean rank b 
unchanged, and therefore mutual pre^ctability remains unchanged. 
Hence, the measure of relation b still unity, but now n^atlve. 


Tabu 10.1.1b 

Perfect Kegatire Correlo- 
lion 


271 





Covariation: 
Qiiantitative Variables 


Sectio.v Onb 

Corrtlation oj Kanks: ^ho 


Concept of Covariation. Heretofore, correlation procedures were re- 
stricted to measuring the relationship among qualitative variables on 
the basis of joint frequencies. But quantitative variabies, too, may be 
associated, and are then said to tovary: a change in one variable is paral- 
leled by a change in the other. Birth rales and family Income, or sui- 
cide rates and proportion of Protestants, may be linked together, and 
the degree of association measured by an appropriate index of correla- 
tion. Of the many correlation indexes which have been devised, only 
three will here be elaborated, each corresponding to a given type of data 
and pattern of relationship; (1) paired ranks (rho), (2) linearly related 
series (r), and (3) curvilinearly related series (eta). 

Measurement hy Rank*. A very simple ordering of data is in the form 
of ranks (Chapter 2, Section 1). The dcaplicity consists in the fact 
that measurement may be intuitive and subjective, rather than by pal- 
pable, objective units of measure. Such rankings may not be very pre- 
cise, and yet they may be very useful. In fact, many sociological concepts 
cannot be conveniently manipulated in any other manner. Conse- 
quently, a considerable volume cd social research is founded on such a 
subjective base. Thus, occupaUons may be ranked by prestige; fellow 
students may be ranked in order of preference; races and nationalities 
may be ranked by favorable or unfavorable prejudice; pictures may be 
ranked by aesthetic value. Even when units of measure are available 

— as, for example, in the sizes of dties or the magnitudes of birth rates 

— ranking may be resorted to when such precision is not required. Since 
270 



CoTAsunoK: Quantitative Variables 

A third possible relationship — between these two extremes — would 
be the chance relation, in which the rank of X cannot be predicted from 
the rank of Y (Table lO.l.lc): any X-rank is equally likely to be paired 
with any T-rank. In this case, at least theoretically, the correlation 
would be zero. 


Table lO.l.lc 
Ciuinee 



A formula to measure the degree of correlation between two sets of 
ranks, and normed to yield the conventional range of cero to unity, was 
derived by Charles Spearman in 1901: 


OSD* 

-T) 


where D > difference between paired ranks 
N - number of items ranked 


( 10 . 1 . 1 ) 


Solving for the three examples just pven, we obtain the following coef- 
ficients: 


Table lO.l.lb Table lO.l.lc 

^ * 6(35) ^ G(35) 

-1-2 -1-.97 


ProeeduTt in Coie of Tied Ranka. Ranks occasionally will be tied. A 
puzzled judge may rate two pictures identically; quantitative scores 
and measures may also tie. In such cases two or more items may seem 
to share the same rank. Since the number of ranks and the number of 
Items must coincide, it will simply not be possible for two items to occupy 
the same rank, so they must be given adjdnisg ranks. In such instances, 
both items will be assigned the arithmetic average of the two adjoining 
ranks. Thus, if the third and fourth items are tied, each is given the 
rank of 3.5. If three or more items are tied, the same rule of averaging 


272 




§10.1 COKEELATION OF lUxXiS: RhO 


judges ranking the same exhibit of pictures may be at wholly different 
locations on the complete continuum of taste preference, but still rank 
the art works in identical order. Judge I may regard them all as on a 
high level of excellence, while Judge II may see them collectively as a 
poor lot. The judges may agree on the order, but disagree on the quali- 
ties themselves. Nevertheless, in many actual situations, it is reasonable 
to suppose that similar rankings do correspond to similar preferences, 
owing to the fact that by and large individuals share a common culture. 
When, however, the two series are not on a single continuous scale, as 
in the case of rental and income, or when they are made up of incommen- 
surable categories, as, for example, birth rates and income, this interpre- 
tative problem of scale location does not, of course, arise. Still and all, 
the worker should be certain, before employing p, that he is interested 
in correlating only rank orders rather than actual magnitudes. 

A Measure of Agreement in Three or More Ranks. The formula for p 
can accommodate only two ranks; however, the data may consist of three 
or more sets of ranks. One method of summarizing the degree of agree- 
ment among three or more sets of rankmgs Is simply to compute the 
mean of all possible p's. Thus, if three judges were to rank six pictures, 
we would compute p for every possible combination of paired sets in 
order to determine the average agreement among them. The paired sets 
would consist of the following combinations: Judges I and 11, II and 
111, and 1 and III. The three p-values would then be averag^, signs 
obserted. The result of this operation is sometimes called the average 
intercorrelaiion of ranks; however, »t might be more properly called, for 
reasons given below, a eoeficient of agreement. 

Table 10.1.3 presents hypothetical rankings by three different judges. 
The average of the three p-values indicates only a moderate degree of 
agreement. Now let us suppose that we have three p-values of 1, —1, 
and —1. The three perfect correlations do not joeld an average of 1.00 
as one might naively expect, but ouly so average of —.33. This Is 
comprehensible only if we view this average as a statement of degree 


Tobfe 10.13 

ffsfpolhetieal Rankings by 
Three Judges 



Read "ibo-bar.*' 


- 275 




Cotabixtion: Quantitative Vakiables 



Fjoms 10.1.1 Saliff I>iaffrtm, J/oatWy Jitni and Family Ineeme 


ready had occasion to note that rank correlation b not based strictly 
on a correspondence between rank orders, but rather on the correspond- 
ence between devbtions of the respective ranks from the mean rank. 
These deviations are measured in terms of sigma units. The formula 
automatically transforms ranks into standard deviates and hence actual 
differences between ranks fD) into «r*differenccs. Therefore, we may 
confidently feed the original rank-order data into the machine-like for- 
mula. 

However, the above set of operations proceeds on the assumption that 
intervals between ordinal ranks are equal. For instance, the gap between 
Hanks 1 and 2 is assumed to be equal to that between Hanks 2 and 3. 
And yet, our common sense tells us that the ideal circumstance of neat 
equal intervals which the formula requires can never be matched by the 
brute data which are ranked. The actual degree of subjective prefer- 
ence of first over second rank b not necessarily of the same intensity as 
third choice over second. Horses may tank in a certain order of passing 
the finish line, but the intervals between them are not the same. Never- 
theless, p may be, and b, used when units of measure are unavailable, 
or when the discrimination between unequal intervab among variates is 
not considered essential. ' 

From this it is evident that p measures the correlation between ordinal 
ranks, and not the correlation between the potential magnitudes that 
are being ranked. Hence, p in general overstates the degree of congniity 
existing between the raw variates, e^iressed or unexpressed. Thus, two 
274 



1 10.1 CoKAUjino.'r or lUxrs: Rbo 


TahU 10.1.4 

Atmge InUTcomlation 
cf Rankt 


I U III 

S S* 

1 3 *4 

2 2 6 

3 1 1 

4 4 2 

& fi 3 

6 6 5 

8 64 

10 100 

5 25 

10 100 

13 ICO 

17 2S9 

747 

, . f3(2l+2) 12(747) 1 

' 1 VDlb) 3l2Ha5)(b)J 

p8 245>1 

" * [to “■ 35 J 

-1-C7.S0-7.11] 

- M 


Question's akd Pkodlcms 

1. Define tttefoIbvisccoQcrpta: 

Rank 

Rack Order 
Equal IntemJa 
Ordinal Nuaben 
Rank4>rdcr Correlation 

2. State the two fandamental dlUjeultka b bterpretbf' rank order* as bdieaton 
of the eoeiil realities whieli thejr purport to represent. 

3. Under »hat cireumstances can qualitatiTe data be ranked? 

4. Are differences between adjacent ranks necessarily equal? lUuatratc. 

5. Differentiate between a g ree m ent and correlation. 

6. Verify by an example that tied ranks reduce rank -order correlation. 

7. Two pcesons ranked six occupations by debtee of prestige b the fallowing 
orders. Compute p. 

hlinistcr rhysician 

College Teacher Banker 

Banker Lawyer 

Lawyer Engineer 

Phytician Mbister 

Engineer College Teacher 

8. Four persons ranked the foUowbg items according to their conception of the 
degree of social distance registered by an endoracroent. Compute the avenge 
mtercorrelation. " 1 would bo willbg to have members of a given race: 

(a) live m my neighborbood.** 

(b) live b lids country." 


277 




Cotibiatton: Quantitative Vabiablis 

of agreement, rather than of corrdation. In thia case of three perfect 
correlations, the predominant rdation ia one of disagreement, averaged 
with one of perfect agreement. 

Judge I and II /> «■ 1 

Judge II and in = —1 
Judge I and III - ^ 

-1 

p - -.33 

Although the averaging of indexes of any type is usually fraught with 
pitfalls, it is not unjustifiable in this instance, for the reason that all 
the p-values are equally weighted, all being made up of the same number 
of items. It should be reiterated, however, that this average is not itself 
a coefBcient of correlation, it is an average of several coefficients. It 
could never ^cld a minus unity, for the simple reason that if two senes 
were correlated —1, a third could not be so correlated to both of the pre- 
ceding. An average of plus unity, however, could be achieved in case 
of universal agreement. 

When the p-values to be averaged are numerous, the computations 
will become laborious. But a shorter method of averaging p’s has bees 
devised,* which is particularly useful when the number of series is la^e. 
The formula is not as formidable as it appears at first view, since all of 
its terms are quite conventional quantities. 

12SS» _ -| (101^ 

^ L(a-l)(W-l) o(o-l)fV{A^-T)J ^ 

in which a « the number of sets 

N — the number of items ranked 
S = the sum of individual ranks of each item 

Table 10.1.4 applies this formula to the data of Table 10.1.3 and the 
same Rvalue is obtained (.31). 

Utilily of Rho. Since p applies to ordinal data, without specific umls 
of measure, it fills an important need of the social scientist who must 
deal in such coarse quanUtaUve measures, which are frequently of a 
subjective character. Sodometric rankings of preferences, aesthetic 
judgments, and the like may aU be correlated in order to determine the 
degree of agreement among the ratings of selectors and judges. Occupa- 
tions may be ranked by socio-econoimc level, and by their respective 
delinquency or divorce rates. Hence, a measure of presumptive relation 
between socio-economic level and im^ence of divorce could thereby be 
obtained. 

• ChArlea C. Peterv and Walter R. Van Vooriiia, Slotulieal Pnctdura and Their ilaOie- 
matital Bua. McGraw-HiU Book Co, New York, 1940, p. 201. 

276 



§ 10.1 Correction or Hakes: Rho 


12. Calculate average intercorrelation by “short" method for Table 10.1.7 and 
interpret. 

13. Baseball fans like to argue the question: “Which is more important for a team, 
offensive strength (i.e., the abUity to hit and score runs) or defensive strength 
(i.e., good pitching and fielding, the ability to keep the other team from 
scoring runs)?” Table 10.13 shows the final standings of the major league 
teams in 1959, the number of runs scored by each team, and the number of 
runs scored against each team. Compute the rank-order correlation, for each 
league separately, between (a) final standing and runs scored and (b) final 
standing and runs yielded to the other teams. On the basis of these p- 
values, wMch factor seems to have been more important in the major leagues 
in 1959 — offense or defense? 


Table 10.1.6 

SeUcled Stalee Ranked 
for Eff’Ciency in Ele- 
tnenlary Editcalion in 
1B60 and for Namher 
of Personi per 1,000 Pop- 
ulation Listed in Who’s 
^Vho?, 1910 


Soum: J. it. Reinlisnlt and G. R. Davies, PrxneipUt 
and AfcMods s/ SoeioloQT/, Prentiee-Hall, Inc.. Engtswood 
CSiffs, 1932, p. 71. 

279 


State 

Education 

Who's Who? 

Alabama ....... 

24 

23 

Arkansas 

29 

27 

Conoectieut 

2 

2 

Delaware. .. 

8 

19 

Florida 

27 

29 

Georgia 

25 

25 

Itlinob 

14 

14 

Indiana 

17 

16 

Iowa 

16 

12 

Kentuel^ 

20 

22 

Louisiana 

26 

20 

M^e 

6 

4 

Maryland 

13 

15 

Massachusetts. . . 

1 

1 

Micbi^n 

11 

9 

Mississippi 

28 

17 

Missouri 

19 

IS 

r<ew Hampsture . 

5 

5 

New Jersey 

9 

13 

New York 

7 

7 

North Carolina. . 

22 

28 

Ohio 

10 

11 

Pennsylvania. . . . 

12 

10 

Rhode Island . . . 

4 


boutb Carolina . . 

21 

21 

TVaoessoe. 

S3 

24 

Vermont 

3 

3 

Virginia..., 

18 

26 

Wisconsin 

15 

6 



Cotahiation; Quantitative Vahiables 


(c) only as speaking acqiuuntaDCes.” 

(d) as husband (wife).” 

(e) as work companlotis in the same plant or store. 

(i) as personal physdaa.” 

I II III IV 

a d b a 

b b c e 

c c d b 

d a f d 

e f e e 

leaf 

9. Ten race-horses, designated A, B, C, D, E, F, G, H, I, J fimsh in the following 

order: II, C, D, F, G, A, J, B, E, I. position at the post was as follows: 

D, F, I, E, J, A, C. G, B, H. Correlate positions at the post with their finishing 

^ order. 

10. Compute p between income and psycho^ rate; ocrapational prestige and 
psychosis rate (Table 10.1.5). 

11. Compute p for Table 10.1.6, and interpret. 


Oeeupational Groups Ranked by Income, Social Pre** 
Table 10.1.5 tige, and Peyckoeis Rate 


OccuPATioNAt Grow 

Ikcovx 

PSESTIOE 

PSTCROSIS 

Rate 

Peddlers 

17 

17 

2 

Waiters 

16 

16 

1 

Domestics . . . . ! 

15 

14 

4 

Barbers, beauticians 

14 

13 

7 

Semi-skilled and unskilled 

13 

15 

3 

Salesmen 

12 

9 

8 

Skilled workers 

11 

12 

6 

Office employees 

10 

8 

14 

Semi-professional (diuggisU, osteopaths). . 

9 

4 

9 

Small tradesmen 

8 

5 

15 

Sub-executives 

7 

6 

10 

Policemen, firemen 

6 


13 

Major salesmen 

5 

7 

16 

Minor government employees. ........... 

4 

11 

5 

Clergy, teachers, social workers 

3 

2 

12 

Technical careers 

2 

3 

ll 

1 Large owners, doctors, lawyers, dentists. . . 

1 

1 

17 


Source: Robert C. Clark, “PfTChoeee. lacome and Oceupedonal PreetUe." -Amencan 

Journal 0/ Soetoio«r. UV, 1849. p. 433 Copjm^t hr the Uaieereity of Chicago. 


278 






s 10J2 SCLTTfIB DUOBAM AND COSSELA'nON TaBLE 


Section Two 

Scatter Diagram and Cornlation Table 

In the previous section, we were concerned with the problem of meas- 
uring the amount of agreement between ranks, or ordinal measures. 
However, there are numerous concepts in sociology that are subject to 
inter\"al measurement: for example, we may measure fertility in terms of 
the number of births per 100 women or wages in terms of dollars and 
cents. When the data being correlated consist of scaled variables, we 
naturally employ techniques that are appropriate to that t 3 q>e of quanti- 
tative data. IVhile these correlational techniques differ somewhat from 
those previously presented, they all answer to the same purpose: namely, 
to express as precisely as possible the d^ree of relationship between two 
or more variables. The ensuing discussion is restricted to the most 
prevalent of all correlation measures: the product-moment correlation 
coe&ieient (r) and the cairelation ratio (i}). 

The <ScaI(er Diagram. The measuiemeot of covariation can be approached 
in a preliminary manner by means of the teaUer diagram, which is related 
to bivariate data in much the same way as the histogram is to univariate 
data. It reveals at a glance the entire disposition of items, thereby 
enabling us to arrive at a rough but useful estimate of the strength of 
the correlation before we actually measui« it. This indispensable device 
has already been employed to depict the trend of a time series (p. 69) 
and the d^ee of correspondence between ranks (p. 274); however, the 
details of its construction have not yet been elaborated. 

To illustrate more fully its construction and use, we take as our point 
of departure Table 10.2.1, which presents yearly income averages and 
suicide rates for the U.S. Central states, 1929-1950. To reduce the data 
of this table to a scatter diagram, we first draw horizontal and vertical 
axes, as in the construction of a histogram. Axes are drawn approxi- 
mately equal in length, unless there b good reason to deviate from thb 
convention. Also, as a matter of convention, the independent AT-vaiiable 
b plotted along the base line, and the dependent K-variable along the 
vertical axb. The estsblbhment of one varbble as the independent, and 
the other as dependent, b, of course, not a statbtical problem, but rather 
a matter of judgment and circumstance. The dependent variable may 
be construed as the effect of the independent variable as cause, or the 
outcome to be predicted from the predictor variable. In many instances, 
there may be no clear “causal" dependency at all, since both variables 
may be ^'iewed as the consequence of aa unidentified third factor. 

Scales are next established on the axes in such a manner as to accom- 

281 



Cotasution; QxriKmATnrz Vasubus 


Table 10-1.7 Ranking ofCompotera by Designaied Groteps 


Boston Stmphont 
Ohchestra 

MnSICOLOOUTS 

(1044) 

Musicolooistb 

(1951) 

1. Beethoven 

1. Baidt 

1. Beethoven 

2. Brahma 

2. Beethoven 

•2. Bach 

3. Wagner 

3. Mocart 

3. Brahms 

4. Moaart 

4. W^ner 

4. Haydn 

5. R. Strausa 

S. Haydn 

S. Mozart 

6. Bach 

6. Brahma 

6. Debussy 

7. SibeL'ua 

7. Hande) 

7. Schubert 

8. Tchaikowahy 

8. Schubert 

8. Handel 

0. Debusay 

9. Debusay 

9. Wagner 

10. Haydn 

10. Tcbrnkowaky 

10. R. Strausa 

11. Schubert 

11. R. Strausa 

11. Tchmlcowaky 

12. Handel 

12. Sibelius 

12. Sibelius 


Bourcc P»ul A. FAraewertb, T/x Batial PtyAolon «/ U\i*ie, Dolt, ItinebArt ud Wmitos. 
lae., N<w York, 19S8, p. 168 (s<Upt«d). 


Final Siandii^t 0 / Major League Ttamt: Garnet Won, 
Table 10.1.8 Kurn Scored, and Rum Yielded, 1SS9 



280 















1 10^ Scatter Ducrah akd Correlation Table 



PER CAPITA INCOME (X) 


FiauEE 10.2.1 SccUerDiaffram,SuieideRaU by Annual Per 

Capita Incomt, Ceniral US., I$t9^l9SO 


tending perpendicularly from Y - 15.2 and X - 602; sir^ariy, 1950 is 
represented by a point at the intersection of 11.9 and 947. The swarm 
of all such points constitutes the scatter diagram. 


Type, of Scalier. It is the pattern ol tUs swarni that enables us to judge 
the nature of the relationship — and such a judgment is ^ essen m 
preliminary to its proper measurement. Thus, it appears t at a x 
increase in income is accompanied by a fixed decrease in the suici^ e ra e, 
that is, the suicide rate changes by a constant amount per unit income. 
Such a relation is termed reelilinrar, or simply linear, because the trend of 
scatter conforms to the track of a stra^ht line. c . j • 

Any such trend line, whether freehand or mathematically fitted, is 
technically termed a fine of reffression. This concept was come y 
Galton in 1877, who used it in connection with his correlational studies 
of the characterisUcs of parents and their offspring. He perceived that 
such a line effectively expressed the tendency of children to r^ess 
to the average level of the parents in a wide variety of traits. The term 
has survived and enjoys a wide currency, although it is no longer re- 
stricted to its original connotatioiL 


283 




CoTABUTION: QXTANmATtVE Vamabi^s 


TalU 10.2.1 

Suicide Rale by ilnnuoJ 
Per Capita Income, Cezi- 
tral U.S., 1929-1950 


Teas 

Per Capita 
Income 

w 

SoicmE 

Rate 

(F) 

1929 

604 

15.4 

1930 

532 

17.9 

1931 

486 

19.1 

1932 

399 

19.3 

1933 

400 

18.1 

19ai 

437 

16.9 

1935 

482 

16.5 

1936 

549 

15.2 

1937 

567 

16.3 

1933 

515 

16.7 

1939 

566 

15.3 

IWO 

602 

165 

mi 

708 

14.0 

1942 

800 

12.8 

1943 

914 

11.0 

1914 

972 

11.0 

1915 

985 ■ 

12.3 

1946 

951 

12.3 

1947 

897 

12.1 

J9i$ 

92$ 

1141 

1949 

878 

11.9 

1950 

947 

11.9 


Source Donsld Ftlfle, SuteH* in fitta/CM to Inctret, 
Vtionitation and Raa, Uspubliihed Master'* Thesis, 
DepsHmeot of BorioloEy, Indiana Cnivereitp, 195f 
Tables 


modate, with a margin«-to spare, the observed ranges of the respective 
variables. Thus, the horizontal scale covers the distance from S200 to 
Jl,200, while the vertical scale extends from 10 to 20. Needless to say, 
enough markers are set up on each axis to ensure accurate and effortless 
plotting. Unlike the histc^ram, the vertical scale in a bivariate plot need 
not begin with zero, for the reason that the focus of attention is on the 
contour of the scatter rather than on relative frequency as gauged by 
the height of the curve. In this respect, the scatter diagram is analogous 
to the semi-log time chart, whose meaning likewise inheres in its shape 
rather than location. 

Having drawn and scaled the axe% we are ready to plot each pair 
of values by a double-duty point. For any pair of variates, the F-value 
fixes the height of the point above the base line and the X-value fixes 
its horizontal distance from the vertical axis. Thus, 1910 is represented 
in Figure 10.2.1 by a point located at the intersection of guide lines ex- 
282 



1 105 SCATTEK DUOSAH AK» COEEZLlTJOJf TABLE 



Fiocb* 1055 5caibr Dia^nm, /fUrmfy e/ Ftding ty 
FarcnhUneu of AUitude (HypoOutial Doia) 


Tte three scatter diagratos presented thus far have djstmctive tren^ 
and one would have little hesitancy in describing them aa excluaveJy 
linear or curvilinear. But scatters of empirical observations are seldom 
so clean and nnambiguous; more often both linear and curvilin^r 
encies combine in the same set of data and thereby complicate e pro 
lem of representmg correUtion by an over-all mea^. For 
tie scatter of delinquency rates and average monthly rentals tof 
small census tracta in the «ty of Chicago (Figure 105.4) 
some Imearity, yet it appears that a curved trend line wouH 
the entire scatter. From left to right, as rentals incre^e, the delm- 
cjuency rate responds by decreasing, but at a progressive y s owct ra 
This is evidenced by the straightening of the swarm. There are of course 
several striking exceptions to the foregoing generalization a >ew ex- 
tremely high delinquency rates occur with above-average rentals. 
mavericks would require special analysis since they reprint a 
down in the “law.” Yet, in the the law of relationship holds f^ly 
well, affording a measure of predictaMlIty of one variable fr^ the oth«. 
If we knew, for example, the rental to be $60, we woiJd predirt delin- 
quency rate to be approximately 3.0, which is the height of the freehand 
Ttgruoum curw at that point To be sure, such a prediction would not 

285 




Cotabution: QuAHirtATtvE Variables 

Furthermore, the relation in Figure 10.2.1 is said to be inverse, since 
the two series move in opposite directicms: as income rises, the suicide 
rate falls. If suicide and income had risen together, then the trend of 
points would have been upward, reading from left to right. Such a 
sloping trend line (Figure 10.2.2) » evidence of a direct linear relation- 



10 20 30 40 so 60 70 80 

VABli ACREAOE (X) 


FtGUBE 10.22 FamHy Siu by Farm Aereaye 

ship between two variables; family size increases as farm acreage in- 
creases. 

The trend of the scatter will not always be linear; rather it may be 
cumltnear and take on any one of innumerable curve patterns. A simple 
example b provided in Figure 102.3, which portrays the relation between 
favorableness of attitude toward a minority people and intensity of 
feeling. As might be anticipated, a decided opinion — whether pro or 
con — b held with considerable intensity of feeling, while a less decided 
or neutral opinion arouses no very etrong feelings. Whether thb pattern 
— strong opinion, strong affect; weak opinion, weak affect — holds under 
all conditions b not here our concern. That would be a matter for em- 
pirical investigation. Here we are merely interested in exhibiting s type 
of relationship which b fairly common in sociological studies. 

284 




Correlation Table 


{ 10.2 Scatter Diaorau and 

scattered for that value} on the other hand, for a rental of $70, we could 
predict with much greater accuracy, owing to the bunching of delinquency 
rates around the regression line. Tins scatter of I’-values for the respec- 
tive X-values is known as acedasticity. If the degree of variation in 
delinquency rates — the width of the scatter band — had been uniform 
for all values of X, then we could have spoken of K as being homosce- 
daalie in respect to X. Actually, the degree of scatter in Y diminishes 
as X changes, so that. Y is heleroacedoatic in respect to X. Hetcrosce- 
dasticity implies that the degree of correlation is not uniform throughout 
the entire series; hence, its presence reduces the feasibility of a single 
over-all measure of correlation, which is after all an average. Just as 
we hesitate to compute the mean of heterogeneous bimodal data, similarly 
we hesitate to calculate an average measure of correlation of a heterosce- 
dastic scatter. 

Heteroscedasticity may be even more glaring than that exhibited m 
Figure 10.2.4; the scatter may be gourd^aped, dumbbell-shaped, or 
J-shaped. as in Figure 10.2.5. These oddly shaped scatter diagrams by 



Goutd-Scstter Dumbbell-Scatter 


Figure 10.2.5 SeUcted Typet of ScaOer 

no means exhaust the variety of types that may be encountered in prac- 
tical work. However, they do serve to ratify the utility of this visual 
aid. Although the scatter diagram yields no mathenmtical me^re of 
correlation, it does indicate (a) whether the relation^p k simply recti- 
linear or more complex, (b) whether or not the relationship is consistent 
over the entire range, and (c) whether the relation is strong or w^k. 
It is an indispensable tool and plays the same role in correlation as 
the frequency graph in the processing of univariate data. It provi es 
a bird’s-eye view of the whole distribution. 

The Joint Freipiency Talle . . Instead of plotting the inividual items of 
the bivariate data, we may group them. Such grouping answers the 
general purposes of all grouping: (1) to reveal the basic pattern of dis- 

287 


Cotasutiok: Quantitatitz Vabiabies 



Fioube 10.2.4 ScaOer Dioffram and Frtthond Rtgrttsim Lint, Dt^ 
litiqutnty Rale by Monthly Rental, tfO Local Areas, Chieago, 19S0 


be free of error, for the obvious reasoo that none of the observed values 
fall right on the curve at that point — all deviate to a greater or lesser 
extent. Evidently, the accuracy of any such prediction would vary 
according to the tendency of the points to bug the line of relationship 
between the two series. When the points move within a narrow lane, 
predictive accuracy, and therefore correlation, would be high; when the 
points are widely scattered, predictive accuracy would be correspond- 
ingly low. Only when all points fall right on the regression line would 
prediction and correlation be perfect. At the other extreme, when the 
scatter is purely random, then we may just as well ignore the socalled 
"predictor var^hte." For any or ^ rentals, our best guess would be 
the over-all mean of the delinquency rates. 

Scedasticity. Knowing the rental to be 8^, we still could not accurately 
forecast the level of delinquency, aoee the delinquency values are widely 
286 


§ 10j2 SCATTEH DuQEAU AKD CoRBEUTlOH TaBLE 

scattered for that value; on Ibe other hand, for a rental of $70, we could 
predict with much greater accuracy, owing to the bunching of delinquency 
rates around the regression line. This scatter of T-values for the respec- 
tive X-values is known as tcedtuiicily. If the degree of variation in 
delinquency rates — the width of the scatter band — had been unilorm 
for all values of A', then we could have spoken of K as being homosce~ 
dastic in respect to X. Actually, the degree of scatter in Y diminishes 
as X changes, so that. Y is heteroteedastic in respect to X. Heterosce- 
dasticity implies tlat the degree of correlation is not uniform throughout 
the entire series; hence, its presence reduces the feasibility of a single 
over-all measure of correlation, which is after all an average. Just as 
we hesitate to compute the mean of heterogeneous bimodal data, similarly 
we hesitate to calculate an average measure of correlation of a heterosce- 
dastic scatter. 

Heteroscedasticity may be even more glaring than that exhibited in 
Figure 10.2.4; the scatter may be gourd-shaped, dumbbell-shaped, or 
J-shaped, as in Figure 10.2.5. These oddly shaped scatter diagrams by 



Gourd-Scfttter Dombbell-Scatter J-5catter 


PicmtF. 105.5 StUded Typts of Scoiier 

no means exhaust the variety of types that may be encountered in prac- 
tical work. However, they do serve to ratify the utility of this visual 
aid. Although the scatter diagram fields no mathematica} jaeasvTe of 
correlation, it does indicate (a) whether the relationship Is simply recti- 
linear or more complex, (b) whether or not the relationship is consistent 
over the entire range, and (c) whether the relation is strong or weak 
It is an indrspensabfe tool and plaj’s the same role in correlation as does 
the frequency graph in the processing of univariate data. It pro%'idcs 
a bird's-eye view of the whole distribution. 

TAe Jtnnl Frequency Table. . Instead of plotting the individual items of 
the bivariate data, we may group them. Such grouping answers the 
general purposes of all grouping: (1) to reveal the basic pattern of dis- 

287 




CoTAKiATiox: QTJAJnrtATnr VAausua 

tributioD, and (2) to facilitate the statistical processing of the data. In 
grouping bivariate data, we classy each case simultaneously in two class 
intervals, thereby locating eadi ease at the intersection of a given row 
and given column. Hence, it ia as though we superimposed a grid on 
the scatter diagram, counted the points in each cell and inserted the 
corresponding number. Such an operation, applied to the scatter of 
delinquency rates and average rentals (Figure 10.2.6), would yield the 
joint frequencies of Tabic 10-2.2: 
















• 









• 


• 







• 



















■ 

■ 

‘ 

• 


♦ 



■ 



■ 

B 

i 

fl 

fl 

fl 


■ 



B 

fl 

8 

■ 

II 

■ 


■ 

■ 

■ 

B 

H 

fl 

m 

B 

fl 

■ 

■ 

■ 

■ 

fl 

fl 

■ 

B 


fl 

9 

■ 


0 10 20 30 40 50 60 70 80 90 100 

ATEBAOa HOWTBLT EEMTAL IX BOLLAXa (X) 


Fioraa 10.2.6 Grid cf* Scalier Dioffram, Ddinqtimey SaU by Ateroff 
ilonViIy Rental 


Naturally, in any real rituatioo we would not proc ee d in this some- 
what fanciful manner; rather, we would immediately tally the unarrayed 
bivariate items in the grid set op for that purpose, and then count the 
joint occurrences in each eelL 


288 



( 105 SCATTEK DUORAJt AJTD COKSELiTIOS TaBLE 


Joint Frtqueney Table; Delintpieney Ralez by Aterage 
Table 1022 Monihly ReniaU 


Deuv* 

AvzaAOE Montblt Rental tji Dollars CO 



Rate 

10- 

20- 

30- 

40- 

50- 

60- 

70- 

80- 

90- 


(D 

0-9 

19 

29 

39 

49 

59 

69 

79 

89 

99 


IS.Q-19.9 




i 

1 






2 

16.fr-17.9 





1 






1 

14.0-15.9 




1 


1 





2 

12.0-13.9 



4 








4 -q; 

10.0-11.9 


1 

1 


1 






3 ^ 



1 

2 

i 

1 


1 




6 1 



2 

1 

1 

1 






5 i 




5 

4 

6 

3 

4 

1 



23 




1 

4 

14 

11 

12 

9 

1 


52 







7 

13 

13 

7 

2 

42 


0 

~~r~ 

14 

12 

25 

22 

30 

23 

8 

2 

140 




Frtyueney {/,) 




. 



Grouping Procedure, To clarify this groupiug procedure, we list the 
•^^wtiTes observed in the construction of Table 10.2.2. 

1. Select a suitable class interval for each variable. 

2. Mark off these class intervals on properly drawn axes, and from 
these coarkers extend guide lines in such fa^on as to create a gridwo 
of cells. Each cell is taken to represent the junction of two claa inter\-ala. 

3. Locate each pair of values in its proper cell and indicate its presence 

there by a tally mark, , , 

Count tally marks in each cell and replace tally by that number. 
Any such number represents a jennt frequency — i.e., the frequency 
with which two class values or midpoints, occur together. 

6. Sum by rows and columns in order to obtain the marginal totals. 
These marginal totals of course constitute the simple frequency disln- 
hutiora of the individual variables. 

6- Sum marginal frequencies to obtain the grand total of ea-ss, A , the 
sum serving as a check on the other. . . , i.i 

la retrospect, we see that the construction of the joint frequency tab e 
proceeds according to well-eslablished principles: the elasaScaUon should 

289 




Cotabution: QuAimrATivE Vabubles 


be fine enough to reveal the shape of the joint frequency distribution, 
but not so fine that some ro^fs and colmnns are completely vacant. But 
if one 13 to err at the outset, it is perhaps preferable to have too many 
cells than too few. The optimum arrangement will be more apparent 
in the light of too much dcUil rather than too little, and it is usually 
easier to amalgamate cell frequencies than to partition them. 

Function of the Joint Frequency Table. We must acknowledge that a 
joint frequency table cannot depict the type of relationship so vividly 
as the scaltergram, for the reason that numerals cuntiot convey so effec- 
tively gradations of density as docs the scatter of dots. Nevertheless, in 
spite of its relative coarseness, it will often be more than adequate for 
purposes of exhibiting the pattern of relationship and assuring its com- 
prehension. 

Moreover, the column and row distributions of the joint frequency 
table permit a statistical measurement of scedasticity which would be 
impossible to obtain from a raw scatter diagram. For such an assessment, 
VC need only compute the staadard deviation of each column or-rov. 
A close similarity among such standard deviations would be evidence of 
homoscedasticity, whereas pronounced differences would suggest heten^ 
scedasticity. 

Furthermore, the marginal distribuUons of the X- and F-vnriables — 
which, of counro, are integral features of the joint frequency table — 
also supply important and recognizable clues to the possible degree of 
Bsaociation between the plotted variables. Specifically, the marginal 
distributions set limits to the degree of obtainable correlation. For 
example, unlike marginals preclude a perfect linear relationship. This 
merely generalizes the earlier observation in respect to the ^-coefficient 
that perfect two-way association in contingency tables is impossible so 
long as marginal sets are not identical. This general principle is illus- 
trated by Table 10.2.3, whose marginals make it impossible to locate 



. 

f. 

TahU 102.3 



1 


10 

Unlike Marginal Sets 

r-7wi.^te 


1 


10 



1 



10 


A 

15 10 s’ 

Jf't'oriaife 

30 


290 




$ 10.2 Scatter Duqrau akd Correl&ttoK Table 

all cases along one diagonal, as would be required in perfect rectilinear 
correlation. No matter how we deploy the cases, we cannot force them 
onto one diagonal, as the student should experimentally demonstrate for 
himself. In general, the mar^nals alwa 3*8 act to constrain in one way 
or another the type and degree of association between paired variables; 
hence, it is always necessary to exandne them in order to deter min e the 
limits which they impose, and the prospects for the valid employment 
of a given correlation index. 

Additionally, the joint frequency table may serve as a computing aid 
which may be called on espemally when the number of instances is large 
or when computing machines are not available. FinaDy, it is a pre- 
requisite to the simplest approach to curvilinear correlation, as will be 
evident in a later section. 

This section has stressed the important role of the scatter diagram and 
the joint frequency table in the preliminary analysis of covariation. 
From these charts it is possible to determine whether the relationship 
is (a) linear or curvilinear; (b) direct or inverse; (c) weak or strong; ad- 
ditionally, (d) whether the variables are homoscedastic in respect to each 
other; (e) whether there are notable exceptions to the main trend or 
"law of relation," and (0 whether the marginal distributions are sym- 
metrical or skewed, and approximately matched. Because of these 
significant disclosures, it is maodatory that, prior to the meoauremeat 
of correlation, a scatter diagram or joint frequency table, or even both, 
be constructed and carefully studied. Without these visual aids, we fall 
into the danger of using improper procedures and thereby arriving at 
misleading conclusions. 

QuEsnorrs and Problems 

1. Define the following concepts: 

Scatter Diagram 

Bivariate Data 

RectUinearity 

Scedasticity 

Curvilinearity 

Joint Frequency Table 

Marginal Dlstnbutions 

2. If the Qjaiginal distributions are nonnal in shsp^ will tbe.BC3tter necessarily 
be homoscedasticT Illustrate aod explain. 

3. Is a perfect rectilinear scatter possible, if margftial distributions are unllkeT 
Illustrate your answer by table or graph. 

4. Does the presence of homoscedastieity in a scatter guarantee linearity? lUua- 
(rete by sLeteb. 

5. Does perfect rectilinearity in a scatter require identical marginala? Illustrate. 

291 



Cotaxutioit: QuumtiTryz VAStuLEA 


Table 102A 

Clothing Expendilum bg 
Total Family ExjienH- 
tures, U.S. 


Table I0a,i 

Per Cent Democratie VgU 
by Voter Registration, 
StUeUd St^i, CcT^gret’ 
eionai EUeiims, ISSt 


292 


Totai. 

ExrEMcrrcitES 
(*00 or Doixm) 

OT 

Clothwo 
E xPENorruMS 
(pEB Cent or T0T41) 

(V) 

y 

3 

10 

8 

14 

12 

18 

13 

20 

14 

24 

15 

23 

16 

34 

IS 

43 

20 


State 

Democratic Vote 
(% or Total) 

Per CftwT 
Reqisteecb 

Am. 

53 

78 

Ark. 

85 

50 

Cal. 

44 

88 

Goto. 

44 

91 

Coon. 

46 

91 

m. 

T4 

T5 

Ga. 

100 

59 

Idalio 

41 

96 

lod. 

43 

96 

La. 

91 

67 

Md. 

48 

62 

Mass. 

46 

88 

Alont. 

43 

83 

Nct. 

50 

81 

N.H. 

37 

93 

NJ. 

42 

85 

N.Y. 

40 

80 

Otdo 

44 

59 

Ore. 

39 

87 

Pa. 

48 

78 

ILI. 

U 

87 

8.C. 

63 

49 

S.D. 

31 

83 

Vt 

28 

83 

Va. 

67 

33 

Wash. 

43 

92 


Source' 08. Bar««a ot tbc Cansua, Slolutieoi 

eS th4 IXJ, lose, U.B. OorerDiDSBt Printint Office. 

WutiinKton. D C., 1952 



{ 10^ Scatter Diagram a^td CoRRCUTioif Table 
G. 1{ a Bcattn is bomoscedastic, does it follow that the relationship is clc^T 

7. (a) Construct a scatter diagram forthe data in Table 10^.4. 

(b) Formulate in your own words the “law of relation.” 

8. (a) Plot a scatter diagram for the following data of Table 10.2.5. 

(b) Characterize the relationship. Does a preponderant one-party vote appear 
to depress voter regbtrationT 

9. Construct the scatter diagram for the delinquency and recidivism rates 
(Table 10.2.6), How would this trend be described in words? 


Table 10.2.6 

Delinqueney and Reddiviam 
Rates, Ckieago, 19i$ 


Delinqoenct 
Rate per 100 

Recidivisu 

Rate 

0.5 

26.1 

1.5 

26.6 

2.5 

35.1 

3.5 

30.1 

4.5 

38.7 

5.6 

54.8 

6.5 

42.9 

7.5 

64.7 

8.5 

32.4 

9.5 

52.3 

10.5 

66.7 

11.5 

50.2 

12.5 

59.9 

13.5 

65.9 

15.5 

53.6 

16.5 

80.3 

19.5 

61.1 


Soum; Cbfford R. Shaw, Delingvmei/ 
Anas, Th* UnlT*nity of Chicago Press, 
1929. Table 21. p. 131. 


10. (a) Plot a scatter diagram from Table 10.2.7. 

(b) What does the scatter of this graph suggest concerning the "puU” of 
economic conditions? Esplun the two fairly distinct clusters. 

11. Construct a joint frequency table for the data in Table 10.2.7. Divide Income 
Index into ten class intervals of width 10, lower rounded limit 50; divide 
Per Cent Bom in Other States into ten class intervals of width 10, lower limit 0. 
TaUy marginal frequencies. 

12. Sketch the marginal distributions which would yield the scatters shown in 
Figure 10.2.7. 


293 




Table 10.2.7 


CklTABlATION: QUANTITAirVE VARIABLES 
Per Cent Born *n OQier Stales by Income Index, UJS. 






} 10.3 Lin£as CoEBCunoK or Two Tabiasixs 


SECTia>f Three 

Linear Correlation of Tuv Variables 

The Need for an Over-All Meaeure of Comlalim. The scatter diagram, 
which was elaborated in the preceding section, reveals to the eye whether 
two variables change together in a S 3 *steniatic manner. The cluster of 
plotted p>omts around the h)’pothetical regression line suggests the “law 
of relation” between the two v'ariables. Jloreover, by obseriing the 
width of the scatter, we can make at least a preliminary judgment as 
to how well the cases conform to this hypothetical law. Such conclusions 
are often of considerable value; yet they still leave something to be de- 
sired, for they are impressionistic, subjective,^and unstandardized. Con- 
sequently, they cannot be described or communicated to anyone else 
without reproducing the chart — or at least engaging in an extended 
account of it. This is, of course, impractical. And since these "eye- 
measures” are not accurate in a mathematical sense, comparisons with 
similar measiires are impossible even if all the intricate charts are s^-ail- 
able. \Miat we need therefore is an objective, standard, synoptic measure 
of the relation between the two variables. This will hnolly be the measure 
of correlation. 

In principle, there is of course no difference between rough measures 
made by the eye and those computed by a mathematical operation. The 
Utter are merely taore precise and serviceable. What we see is a hypo 
thetical trend Ibo to which the erwann of points more or less conforms 
and from which we deduce the d^ree of correlation. But in order to 
measure this correUtion we must (1) establish exactly the position of 
such a line, (2) measure the amount of conformity thereto, and (3) trans- 
late this result by a prescribed method into an index of correlation. Our 
first concern, then, would be with the location of the line which best 
fits the observed data. 

As we have previously learned, a relation may be rectilinear or curvi- 
linear; hence, the best-fitting line ma 3 " be straight or curved. In this 
section, we restrict ourselves to those relations which are rectilinear, or 
approximately so, and which can therefore be legitimately represented by 
a straight line. In other words, our concern will be with those relations 
in. wbifib. a constant change in one variable ptoduces, on the avecage, a 
constant change in the other paired variable. Such a relation has already 
been illustrated in Figure 10.2.2, which plots the size of family* and the 
size of farm in China; family size increases by one person on the average 
as farm size increases by ten acres. 

Here, for purposes of exposition, let us use some very simple data 
which exemplify the relation between income and social status score 

295 



Cotabiatioh: Quantitatite Variables 


(Table 10.3.1a). It is reasonable to deagnate social status as the depend- 
ent V-variable, since status appears to adjust itself to income in the 
long run, rather than the reverse. Once cases have been plotted to form 
a scatter diagram (Figure 10.3.1), we wish to draw the most representa- 
tive straight line. 


Table 10.3.1a 

Income and iSoetal ^Status iSeore, 
Selected Families 


IWCOMB W 
Tbousakts 
or Dollars 

W 

Social 

Status 

Score 

m 

3 

3 

7 

S 

11 

7 

14 

6 

IS 

9 

X - 10 

P-6 

Source; Sussnied If V 

f. Ileyd W»rser’» 


•t^s* ia icciaI elM». 



INCOUE IR TBOOSAND8 OV IKILLARS {X) 


296 


Fiomu; 10.3.1 Freehand Trend Z/ina on SeoOer Diagram, 
Income and Social Status 



S 10.3 Lc^ur CoRREtATioK or Two Vasubixs 

Frwfiatid Regression Line. It would erf course be possible to draw kree- 
hand what appears by inspection to be the best-fitting trend line. If we 
had dravoi such a line and were then asked to defend its location, we 
would spontaneously reply that we had drawn it "right through the 
middle” — that is, as close to all the points on the average as possible, 
cutting the swarm lengthwise into two equal bands. It was drawn down 
the middle because we intuitively felt that this line was the best average 
for all of the observed points. It thus serves as a running estimate of 
the i’-values, which takes into account every observed value of X. If 
we had been asked to guess a person’s eocl^ status score from his income, 
say 810,000, we would not guess a I'-value outside, or even close to the 
edge, of the repon of scatter, but rather a central value within the swarm 
above the designated X-\'alue. This would be a status score of 6. Such 
tstimaied raluee of T correspond to the observed X's and constitute the 
aforementioned trend line. Naturally, the more closely the obser%'ed 
points hug the line of estimation — the less they digress from it — the 
smaller the error in prediction of Y from X. If income were the sole 
determinant of social status, the observed points would necessarily all be 
on the fine, without error; if social status alone determined mcome, the 
points would again fall on the line without reridue. 

Now, what is the significance for correlation of these departures from 
the hypothetical regression line? To the statistical analyst it indicates 
that ^ere are other factors besides mcome which determine social status. 
The operation of these other, unknown factors disturbs our prediction. 
Hence, they reduce the correlation between the two given variables. 

{’’ofiKS Eipreesed cu i^mations from Mean. In Table 10.3. la and the 
scatter diagram derived from it, the variates are ^ven in their original 
form. But for several reasons, not fully amplified here, it is more mean- 
ingful to compare the variables after they have been transformed into 
deviations from their respective means, X and F. 

When comparing ^ralues, as we do in correlation, the student must 
accustom himself to thinking In terms of deviations from the mean instead 
of the raw data. Like so many other statistical procedures which may at 
first seem needlessly indirect and complex this too conforms to common 
sense. We do not ordinarily compare raw lest scores, salaries, birth rates; 
but rather consider them as “high” or “low” — that is, above or below 
the average or norm to wluch we are accustomed. Still less can we com- 
pare units of dlfierent categories, such as incomes and birth rates in their 
raw form; we associate income bdow average with family sise that ex- 
ceeds the average. Thus, we see that the mean is the appropriate point 
of origin — the natural standard — whenever we compare two sets of 
data in respect to their affinity with each other. 

The items in Table 10.3.1b are therefore expressed as deviations from 

297 



CoTifiMTIO.V: QPi-YTITATIrE ViHllBlt? 

their respective means, and then plotted in that form (Figure 10.3.2a). 
In graphing deviations, we must of course be able to accommodate nega- 
tive values, and consequently we use four quadrants, corresponding to 
the four combinations of agna. 


Tohfe 103.1b 

Incouc is 

SoaAL 



Status 

Ineomti and Social Status Scan* 



a* Dm(dion» from X and F 

(*) 

(M) 


-7 

-3 


-3 

-J 


4 

0 


S 

3 

f 

0 

0 



It should be observed that the cooflgumtion of plotted deviations is 
identical with the scatter of oripnal values; only the scale markers 
have been ch-anged. All we did was to shift the rero origin to the inter- 
section of the means. 

Explained Deeialions. The freehand trend line, drawn straight through 
the middle, is obviously an average for all of the observed points, a con- 
298 


J lOJ Ijxtxt CosxzumoK or Tifo Vasuslzs 


tinuous Riccesson of nonns, u it were, adjurtcd to ttc entire series of 
eases. Since it conrisls of the deviations wWch one would expect on the 
average, the values on the eloping treed line are ^•arioc.*!/ called the 
“expected,” “predicted,” or “estiaated” values. The expected devii* 
tions are grapWcally plotted (Figure 10.3.2b) by vertical lines extending 



299 



Cotisutiok: QuAwnTATm! Variables 



txplaimd; that ia, they must be attributed cot to income, but to unknown 
factors on the identity of which we may only speculate, since additional 
data are not available in the problem as set up. Hence, they are called 
unexplained residualt, measured by the respective distances from the 
trend line to the individual observed points in the swarm. 

From inspection of the foregoing charts, we must conclude that the 
closer the correspondence between the eicplained and observed devia* 
rions (i.e., the smaller the residuals) the higher the degree of correlation. 
It would therefore be logical to measure the degree of correlation accord* 
ing to the degree of correspondence between the estimated and observed 
deviations; or by the proportion of the aggregate observed deviation 
that is explained. This is the fundamental principle of the measurement 
of correlation and is always implicit in its calculation. To be sure, the 
measurement is not carried out iofonnally by freehand regression lines 
or by graphic readings obtained with a ruler or a pair of dividers; still, 
it remains essentially a comparison, or ratio, between the explained and 
total deviations. 

Since these concepts play sudi on important part in the subsequent 
presentation, it will be well to sanunarise their meaning precisely: 

(1) Total (Observed) Deviation: the deviation of an observed value 
from the mean of the seri^ 

(2) Explained Deviaiion: the deviarioo of an expected (regression) 
value from the mean of the eeriea. 


300 



S 10.3 Likeab Cobbelation or Tvo Vasiables 

(3) Unexplained Residual: the discrepancy between the total and the 
explained deviation. This of course, the difference between 
(1) and (2) above. 

T&e Measurement of Linear Correlation 

A serious disadvantage of freehand graphs is that they rest on personal 
judgment. Without a standard operating procedure, it is unlikely that 
two workers would ever locate the trend line in exactly the same position. 
Obviously, a line that is used to measure correlation in the manner sug- 
gested above must always meet the same specifications; otherwise the 
results will be lacking in the reliability which is essential to scientific 
procedure. 

Such standard specifications have been formulated as follows: the 
line is so located as to make the sum of the vertical residuals around that 
line equal to zero — that is, to make the sum of the positive and negative 
discrepancies balance one anotber. Thus, such a line represents the 
Bcatlcr around it, as the mean represents its array. And like the mean, 
it makes the sum of the squared deviations a minimum — it conforms 
to the principle of least squares.* Because of these properties it is labeled 



301 



CoVABtiTION! QCAlfTITATTVE VaHUBLES 


the Ime of best fit; by the criterion of least squares, it fits the scatter 
better than does any other straight line. 

The tracing of this mathematical Une of best fit is simplified by again 
plotting cases as deviations from means. For, under these conditions, 
the line of least squares rrill always conveniently pass through the zero 
origin, which lies at the interseetioa of the means. Consequently, it 
may be readily plotted once its dope on the r-axis has been determined. 

Computation of Slope, The calculation of this slope is according to the 
following equation, presented here without explanation: 



where b,i = slope of line of expected y’s on observed z’e 
(read: “h, yon*”) 

Szy -• sum of products of paired z- and yHleviations 
(read: “sum of cross-products") 

This ratio is the elope of the sought-for best-fittbg line; it is tbe aver- 
age change in Y per unit change in X. Accordingly, to find this average 
change in social status per unit change in income, we compute the cross- 
products and squared z-deviattons, and form the ratio between their 
respective sums. Performing these operorions (Table 10.3.2), we obtain 


Table 10.3.2 Calculation of b„ 


X 

y 

* 


** 

zy 

3 

3 

-7 

-3 

49 

21 

7 

5 

-3 

-1 

9 

3 

11 

7 

1 

1 

1 

1 

14 

6 

4 

0 

18 

0 

15 

9 

6 

3 

25 

15 

2-60 

30 

0 

0 

100 

40 



Zzy 
'Zi* “ 

i 2--.4 

100 



6». “ .4. This is the average rate of increase in social status per unit of 
income; that is, for every S1,000 increase in income, there is an average 
increase of .4 status points. Because this average fixes tbe inclination 
of the regression Une to the axis ot the independent variable, it has come 
to be known as slope. Once detemuned, it enables us to plot the line of 
least squares and proceed to the predse measurement of correlation. 

302 




§ lO^ Lktab Correlation ot Two Variables 

PloUirig the Regression Line. Since a straight line is determined by any 
two points, we plot two points known to Ke on the least-squares line and 
nm a straight line through them. For one of these, we pick up the point 
of origin {intersection of axes); the other point is most conveniently 
obtained by caleukting the expected value of y when x = 1.00. This 
will of course be equal to 6,,. Accordingly, we draw a line through the 
zero origin and a point plotted at the intersection of 1.00 and .4. This 
line is plotted in Figure 10.3.3. 

It is this line that supplies the explained deviations (y’s) corresponding 
to the observed x’s, since the unbroken line of best fit necessarily consists 
of all possible explained deviations. From it, we may read the approxi- 
mate magnitudes of the explained deviations; but they may be more 
precisely determined by appheation of the slope formula: 

(10.3.2) 

where y = expected deviation in Y for any given i (read “y-cjrcnmfiex"). 
Substituting the observed x-deviations in formula, and solving for the 
y’s, we obtain the results shown in Table 10.3.3. These y's are the y’s 


TaiU 103.5 

Complete Set of y-Valttes for 0th 
served x’s 



that would have been observed if X and V were perfectly correlated. 
But these (/’s were not observed; hence, we must determine how close 
they come to those obser\’ed, in order to fix the d^ree of correlation. 


Coefieient of Determination. We have thus come to the final stage in the 
measurement of correlation. This mcasuiement proceeds on the principle 
that the more nearly the expWned deviations approach the total devia- 
thas, obviously the greater is the proportion of %'sristioii explained and 
the higher the degree of correlation. As a statistical operation, the 
conversion to a single index conasts of summing the squared explained 
deviations, and expresring this explained variation as a proportion of the 
total variation to be explained. Symbolically, 


^ Explained Variation Sfr* 
” ToUd Vanatioa ** 2y* 


(10.3.3) 


303 




Covabiation: Quantitative Variables 

This ratio, be it noted, is symboUzed by r*. It is the fraction of the total 
I’oriation in the dependent I-varinble which is determined by the inde- 
pendent X-variable; hence, it is called the CoeJictCTit 0 / Deletmination. 
Since it is impossible to explain more than the total variation, r* can never 
be greater than unity, and will practically always be less than one. 

Had the simple algebraic deviations around the mean been employed 
as measures of %*ariation, they woufel have summed to zero — a very 
impractical situation which would not have permitted any lalica at all. 
Hence we turn to squared deviations, in compliance with the principle 
of least squares. 

But in order to circumvent this more complicated computation, would 
it not have been possible to take the simple arithmetic deviations from 
the median? This was, in printaple, the solutiem of Francis Galton who 
originally propagated the concepts of regression and correlation in his 
famous work Natural rnherilanee (1889). However, the mean has prop- 
erties not possessed by the median which make it more useful in correla- 
tion computations. And once the mean has been chosen, there is no 
alternative but to adopt the least-squares principle for its implementa- 
tion. 

The complete computation of r* is illustrated in Table 10.3.4, where 


TcbUlO.iA Computalum 0/ r* 


S V 

b s a y* 

-7 -3 

-3 -1 

1 1 

4 0 

6 3 

(.4) (-7) -28 784 9 

(.4) (-3) -1.2 1.44 1 

(.4) (1) .4 .10 1 

(.4) (4) 1.6 2.66 0 

(.4) (5) 2.0 4.00 9 


0.0 16.00 20 



the explained and observed deviations are squared and summed to give 
the explained and total variation, respectively. Upon dividing the total 
variation, 20, into the explained variation, 16, we obtain r* = .80. This 
is the final measure of the degree of association between the two vari- 
ables: it reveals that 80 per cent of the variation in y is accounted for 
by its linear dependence on X, 

Reversibility of r*. We could have computed the regression ai X an V 
as easily as the regression of F on and thereby determined the pro- 
304 





1 10.3 Ld?eas CoBSEUTiOK OT Two Vasusles 

portion of X-variation explaut«d Y. In fact, such a result would 
appear essential to a complete statement of correlation. Why compute 
rj, and i^ore jf,? The answer is that it has not been ignored, although 
We did not compute it independently. It is simply unnecessary to fit 
both regression lines for the reason that r?, — rj,. In short, r* is revers- 
ible. For example, knowing that fj, = .SO, we may slate not only that 
income explains 80 per cent of the Tariation in social status but the re- 
verse as well, namely, that social status accounts for 80 per cent of the 
variation in income. Since, from a purely statistical standpoint, each 
explains the other to the same degree, the subscript U usually not attached 
to r*. 

Cofjia'ent of Non^tleminalion. Since the difference between total 
variation and explained variation is necessarily equal to the unexplained 
%'ariation, it follows that the unexplained proportion is simply the differ- 
ence between unity and r*, a quantity appropriately termed the Coe^icient 
of Non-delermination. We may write Has follows; 

. ^ ^ . Explained Variation 

" Total Variation 

We could ha^’C obtained this quantity directly by measuring the resid- 
uals around the regression line, squaring and summing them, and ex- 
pressing this sum as a proportion of total %'ariation. In fact, this U tbo 
operational meaning of 1 - r*. But whether calculated directly from the 
residuals, or indirectly via r*. this cocflicient may alwaj's be constmed 
as the proportion of variation In the dependent variable left linearly 
unexplained by the independent variable. It thereby gauges the strength 
of the unidentified factors. 

PtaT$onum Prodwt-Mommi r. Conventionally, it is r rather than r* that 
b selected and quoted to indicate the degree of correlation between the 
two variables. ^Miy b thb so, particularly since r* seems to have all of 
the prerequisites for a satisfactory index of correlation: it ranges between 
zero and unity; it has a simple, comprehensible meaning? Nevertheless, 
unless another index b specifically named, it b assumed that the correla- 
tion coefficient refers to PeareonUtn r. How b ihb convention expbined? 

Before formubting an answer to that question, we must first acquaint 
ourselves with the dbtinctivc properties of r. Accordingly, we return 
once again to the scatter diagram and (he fine of best fit. Cur procedure 
will be exactly the same as that followed previously, excepting that the 
data will now be plotted as standard deviates (c-valuea), instead of the 
observed dc\Tation8 (Figure 10.3.4). Again the configuration of points 
b left unaffected by the transformation: the sole alteration b that the 
scale unit b now the standard deviation rather than the original raw 
unit of measure. 


305 



Cotabution: Quantitative Vasiables 



Tatle 10.3.5 

as iStandard 

Denotes 


Stanuakd Devutm 


1,12 

0.00 


To establish the line of best fit through this scatter of <r-pointa, we 
first calculate its slope as before, except that now we operate on the 
standard deviates instead of the raw de^tions. Consequently, the slope 
would now be written in standard form: 


sis 's kiss 



5 10^ LnCEAS COERELATIOS Of TwO TlRUBtXS 


instead of: 

Zty 

2i* 

Since — is usually symbolised r, the elope formula may be simply written 

(T 




(10.3.4) 


and symbolized r,* instead of 6„. This is the culmination of our develop- 
ment. We are now in a position to see that the slope of the regression 
line through the scatter of (T-pointa is identical with the square root of 
explained variation: that is, r- Vr^. For the example above (Table 
10.3.4), - ±.89. 

To compute r directly, we would proceed as in Table 10.3.6, where 


Ta6U 103 j6 

Peofionian r as i/ean of 
Cre»t-Produ<ti of Stand- 
ard Deriaiss 


t. 

fw 


-1.56 

-1.50 

2.310 

- .67 

- 50 

535 

52 

50 

.110 

59 

.00 

.000 

1.12 

150 

1.6S0 



4.465 

4.46 

59 


N 5 


paired standard deviates are multiplied ti^etber and those cross-products 
are summed and averaged. The mean of the cross-prrxlucts is r. 

We can now understand why r was designated by Karl Pearson "prod- 
uct-moment correlation coefficient.” From the composition of Formula 
10.3.4 above, it is clear that r b strictly an arithmetic mean; it b a sum 
divided by the number of items in that sum. Now, the tarms "mean” 
and "moment” are used interchangeably in mathematical statbtlca; 
hence, Pearson labeled the mean of the standard cross-products the 
product-moment correlation coefficient. 

Mewed in thb light, r may be construed as the mean change in }' for 
every unit change la X, or the reverse, always assuming measures in 
standard form, Le., expressed in terms of sigma unite. Thus, if a given 
income devbtes by from its mean, we expect the associated status 
scores to devbte on the average from their mean by .89®-. By \-irtue of 
thb principle, we may estimate V (in standard form) for any given 

307 




$ 10.3 Ltxua CoBazL&nojr or Tvo Yasubles 

dependent, r* measures the over-all proportion of the total variation of 
one variable that is associated with, or explained by, the other. 

On the other hand, r measures the dynamic aspect of this relation, 
measuring the rate of change in one variable relative to the other, as 
has been demonstrated in the paradigms above. Because of this con- 
ceptual distinction, we may say that r is primarily a predictive device 
to forecast, for example, the expected level of performance on one vari- 
able from obser>’ed performance on another. As such, it is probably 
more likely to be «ised by educators, p^chologists, and others interested 
in personal prediction, rather than by sociolopsts. On the other hand, 
r* is a Eummaruing measure wdghing the influence, or force, exerted by 
one variable on the other. 

Since r is slope, it manifestly must have a direction: predominantly 
up or down, according to whether the variables are positively or nega- 
tively related. It follows that the direction of the slope reflects the type 
of relation, which is then symbolised by a plus or minus sign. The sj'nop- 
tic measure r*, however, carries no sign since it expresses a proportion 
of a total variation. 

It is now clear that r and r* are not interchangeable; nor should they 
be cheaply derived from each other until their structural meanings are 
thoroughly understood. Since r b alwa3rs larger than r‘, it could be 
deliberately or unwittingly used to exaggerate the strength of associa- 
tion and thereby mislead the reader: for example, r - .5 may seem to 
signify a reasonably strong assoebtion, but r> - ^5 indicates that only 
25 per cent of the variation in either variable b accounted for by the 
other. When the empbasb b on the strength of the over-all relationship 
between two >'ariab1es, as b frequently the case in eociological studies, 
r* b the pertinent statistic. For e.xample, the finding that 50 per cent 
of the ^'ariation in area delinquency may be attributed to the economic 
factor still leaves something unexplained; yet it represents a tangible 
gain in understanding a phenomenon that imtil a century ago was laid 
at the door of demons, heredity, or free wilL 

In sum, each measure has its very distincti've connotations and ap- 
propriate uses. Nevertheless, r enjoys a near monopoly over r*. This 
may be due partly to the inertia of tradition, which will perhaps be dis- 
sipated when workers become more sensiUve to the nuances of quantita- 
tive reasoning. 

Computing Fcrmula^: Ungrmprd Data. The extended analytic opera- 
tions preriously given were designed to expose the logic and intent of 
the concepts of r and r*, so that they could be discriminatingly applied 
in social research. But such elaborate calculations need not be carried 
out if the sole object b to determine the mere numerical value of r or 
r* in a gi\-en practical problem. There are handbook working procedures 

309 



Cotariation: Quantitative Vasiables 

value of X, exactly as we applied to obtain the expected y-deviations. 
We have only to apply the fonnula: 

2, « r(2.) (10.3.5) 

where I, estimated <r>velue in Y 
tt ° observed (revalue in X 

Substituting our data in this fonnula, we get the estimated values shown 
in Table 10.3.7. Thus, when income is 1.12 in standard form, social 


Table 10.3.7 Eglimated z, from Observtdz, 



(r)r. - K 

-1.56 

(89)(-1.56) = -189 

- .67 

(89)(- .67) = - .60 

.22 

(89)( 82) - 80 

89 

(89)( 89) - .79 

1.12 

(89)( 1.12) - 1.00 


status is estimated to be 1.00; when income b — social status would 
be “1.39tr. Perfect correlation between two series would prevail whenever 
the paired values are identical tigma distances from their respective 
means — for example, if the person with a sociology grade of, say, l.Stf 
above the mean were located 1 . 8 <r above the mean of the hbtory grades, 
and all other observations in the two series were also perfectly paired. 
Because r b reversible, we could have applied r to the observed and 
thereby estimated the corresponding z,’b. 

Comparison of r and r*. On the surface, the difference between r and r* 
appears very trivial: a simple detail of exponent, with both values easily 
convertible into the other, after one has been computed. Nevertheless, 
this seemingly innocent difference cannot be so lightly dismbsed, for the 
two r’s respectively focus on two distinct but interrelated aspects of 
covariation — a distinction wluch the design of the present chapter has 
deliberately sought to portray. 

Either one could have been mecbauicaliy derived from the other by 
way of only one of the two procedures. However, an r* derived from r 
would never convey to the student the intent and meaning of explained 
variation which r’ measures. The “square of the slope” would mean 
nothing to him. Nor would an r, cmverted arithmetically from r’, con- 
vey the intent and meaning of the slope. The “square root of explained 
variation” could not be vbualized as bIoik Although r and r* are inter- 
308 






5 10^ Likeak Cobeelation or Two Vaeubles 


Grouped Daki. The computiag procedure for grouped data is essentially 
the same as for ungrouped, except that we operate on coded class mid- 
points instead of the unarrayed, individual values: 


yZ/i'y'-S/r'S/v' 
vlws/V* - 


(10.3.8) 


To illustrate Formula 10.3.8, we apply it to the correlation table of 
delinquency rates and average rentals (Table 10.2.2, p.- 2S9), reviewing 
at the same time the construction of that table. The steps in the pro- 
cedure are as follows: 

1. Prepare a grid having at least as many rows and columns as class 
intervals in X and Y. Our grid has 10 rows and 10 columns. 

2. Write down the X-inten,*alsof uniform widthintheuppermarginascol- 
xunn heads ; write down the F-intcrvals in the left-hand margin as row heads. 

3. Turning to the raw, unordered data, cross-tabulate each item by 
placing a tally mark in the appropriate cell. Count the tally marks in 
each cell and record the number in the center of the cell. 

4. Sum the row frequencies, recording them in the column labeled / 
(Table 10.3.9); eum the column frequencies and record them in the 
row labeled /. Then sum these row and column marginals for the grand 
total (140). 

5. In the row labeled z" and the column labeled y', code (be class mid- 
points for the X- and I'-^wkes. In order to simplify subsequent arith- 
metic, the sero origin should be located near the densest eoDcentration of 
items. For the present data, then, we would place 0 in the X-inter\'al 
50-59 and in the K-inlerval 4.0-6.9. These ''intervals of origin” may 
be identified by double ruling of column and row. As the student has 
prenously learned, a plus or minus rign is attached to each coded mid- 
point according to whether it is larger or smaller than the zero origin. 

C. Multiply the coded midpoints by their corresponding marginal fre- 
quencies (colunm /times column y'; row/ times row i'); then sum these 
weighted %'alues. For our example, S/x' - 1 and S// - -W. 

7. Calculate the weighted squared coded values by multiplying the 
weighted coded >TJue3 (column // and row /xO by the coded >alue3 
(column v' and row xO- For our example, Z/x^ •• 4S9 and S//* - 524. 

8. Multiply each z' by each y' to obtain the coded cross-products: 
x'y”8. For example, multiplying i' « — 2 by each y* yields the following 
values: 4, 2, 0, —2, —4, —6, —8, —10, —12, —14. These items are con- 
ventionally entered in the upper left-hand corner of each cell. 

9. Multiply each I'y' by its corresponding cell frequency and place 
product in lower right-hand comer of each cell. Sum oil such items, 
first adding by rows (columns) and thence to the grand total. For our 
e.xample, Z/x'y' - —291. 

10. Substitute sums called few by formula and solve. 


311 



COTAKIATIOX: QcxNTTTATITr VaRUBLIS 


that obwte the need to compute dope, the explained deviations, the 
unexplained residuals, and other quantities involved in the forroula-s 
Riven above. Of course, such tnechanical computing methods are not 
interpretive tools; they are merely recipes that can be followed by any 
intelligent elerV or machine. 

The computing programs here given begin with the product-moment 
formula, which may be rewritten for somewhat simpler calculation: 


Sry 

v/lViV 


{10.3.6) 


And it may Iw still more conveniently rewritten, particularly when a 
desk calculator is available, in terms of the original, untransformed %'aluw: 


SIXY - 

VCA’i:x‘'MiAYlA'2:>’‘M::VT«] 


(10.3.7) 


This is the computing formula for r, ungrouped data. It intends to 
clirninate many of the vexatious arithmetic details that arc sure to arise 
when the operation is tarried out on the raw deviations around roean-s. 

This formula, be it noted, requires only fisv sums and comwpond/ngly 
five work eolumas. Two of th<se columns compri«c the original tabula- 
tion of X and 1', so that only three more arc nt^ed in order to proeee*! 
with the ralculalion of r. Sul>stituting sums in the formula, we again 
obtain r <■ .89. 


TTortraWe /or CaJeula- 
Tabie 10.3.8 tione/rfrom Vngrovped 
Valuf$ 


X 

r 

.f 

n 

xr 

3 

3 

0 

9 

9 

7 

5 


25 

35 

n 

7 

121 

49 

77 

11 

« 

m 

3C 

81 

15 

0 

225 

81 

135 

Z - 50 

30 

f>00 

200 

3t0 


5.'3li)) » (S0)(30> 


1 V[SIUX») — 



1 1700 

-I5iy» 




I V(Sny,‘jiuyj 




cro 





- .VO 






310 




§ 10.3 LiNE&K COBRELITION OT TwO YaSIABLES 



TabU 10.3.9 Corrtlation Table pf Delinjuenqf Ratet and Average Monthly Rentals 



Cotasiation: Quawtitativi: Variabces 



312 




{ 10.4 Tee Cobeelattok Ratio 

Unexplained Variation 
Regression line 
Line of Least Squares 
Product-KIoment Correlation Coefficient 
Slope 

Total DeviaUon 
Explained Deviation 
Unexpluned Deviation 

2. If r •> .3, how much of the variation in F is linearly associated with variation 
in XT 

3. Evaluate the following ria as high or low: —.75, .69, .25, .96, —.96, .50. 

4. How may one explain the unexplained residuals: as the result of chance or 
determining factors? 

5. Let us suppose that r » ,3 between college grades and hours of study. Analyze 
this *'Iow correlation.” Is this evidence that study does not affect grades? 
Discuss. 

6. What kinds of marginal ^tributions should make one cautious about using 
the product-moment r? 

7. Should r be used if the marpnal distributions are highly skewed? 

8. “The use of r requires that the scatter of cases around the regression line be 
hoffloscedastic.” Elaborate on this statement. 

9. Compute r and r* between total expenditures and clothing expenditures (Table 
10.2.4). 

10. Compute r and r* between per capita income and percentage bom outside the 
state (Table 10.2.7). 

11. list states in which the percentage bora outside is greater than 25. Calculate 
r and r* for this subgroup. Compare this r with that obtained in Question 10 
and explain the difference. 

12. Compute b„ — Zry/Z]^ for the illustrative problem given in the text (p. 303). 
Compare with 

13. What would be the effect on r* of greatly extending the ranges of X and F? 
(llinl: analyze raUo of expired to total variation.) 


Sectiom Four 
Tkt Comlation Ratio 

CtiTvUinear Regrtssion, Any index of correlation is a measure of the 
extent to which one variable may be predicted, or explained, in terms 
of the other. Of the many indexes that are employed, PcarsooUn r is 

315 



COTASIATIOX: QCAKTITATlVr ViHUBlIS 

Function of the Correlation Table. The correlation tabic should be taken 
for what it ia: a mechanical aid to computation. It is stripped down to 
essential procedures which help one to get the right answer. It is a 
kitchen recipe which, when meticulously followed, will guarantee good 
results even in the h-ands of amateurs and clerks. Such a recipe, how- 
ever, does not call for higher criticism of its construction and the selection 
of ingredients; and the beginning student should ntit attempt to pene- 
trate these equations and routines for the “meaning" of correlation or 
for confirmation of its theory —■ features which are effectively con- 
cealed in this “elRcicncy table." 

The device of the correlation table is especially expedient when the 
research design calls for a large number of correlations or intcrcorreb- 
tions, the computation of wWch may be delegated to clerical assistants. 
For this purpose, inexpensive pre-printed commeicial forms are available 
which permit the processing of large-scale operations in orderly manner. 

The only precautions which need be observed are those that are rele- 
vant to any other tabulation, such ns to obtain the optimum size of 
intervals in order not to introduce unnecessary grouping errors, and to 
perform suitable checks on accuracy In calculation. 

The execution, by pencil and paper methods, of a correlation chart, 
even when supplemented by n desk calculator, may seem to some re* 
search workers like an uneconomical obser%’anc« of a traditional device 
which should be replaced by high-speed computers. There ere three 
rebuttals to this plausible contention: first, not every worker has a 
high-speed computer at his beck and call, nor may the project Itself be 
of sufficient proportions to justify the use of such expensive mhchina 
even when available. Tj^pewriters and printing presses have only par- 
tially displaced the handwritten record. Second, it is the position of 
tills text that, for the learner, pencil and paper methods, which might 
seem somewhat primitive to those who cultivate large-scale designs, 
are still the best laboratory method for the growth of understanding. 

Third, such devices as the correlation table are visual aids to the worker; 
they permit, for example, the examination of scatter to avoid imposing 
the linear formula upon a too obviously non-Unear distribution. These 
visual aids entirely disappear in the dark recesses of a computer, however 
efficient they may be to n mature scholar who has the elements of his 
subject safely behind him, and who may no longer appreciate the aid 
of the lower rungs of the ladder upon which he has climb^. 

Questions and Psoblems 

1. Define the following concepts: 

CoefFicient ot Determination 
Explained Variation 
CoefEcient of Non-deteiminatioD 


314 



5 10.4 Thz CoBBixinos Ratio 


Table 10.4.1 

Price Index and 
Marriaye Rale, 
1887-1906 



Price 

hlARElAGE 

Year 

ImtEX 

Rate 



(F) 

18S7 

84 

87 

1888 

87 

88 

1889 

84 

91 

1890 

81 

90 

1891 

82 

92 

1892 

76 

91 

1893 

77 

90 

1891 

69 

86 

1895 

70 

89 

1896 

66 

90 

1897 

67 

89 

1898 

69 

SS 

1899 

74 

90 

1900 

80 

93 

1901 

79 

96 

1902 

85 

98 

1903 

85 

101 

1904 

86 

99 

1905 

85 

100 

1906 

88 

105 

Settrec: i. M. R«ilihanlt ud G. IL Dtriaa, 
BrineifU* and UttMai* of Sor%elon< 
^Mn<^DaU. Ine, Eac^nrood CUf a N j., 
1933. p. «2I (adapted). 


perity, marrbge rates -will abo rise; as prices fall, a symptom of depre*- 
cion, marriage rates will also fall, but not necessarily at a constant rate. 

In order to ascertain whether these data conform to the linear or curvi- 
linear model, it will &rst be necessary to prepare the usual scatter diagram 
(Figure 10.4.1), which permits us to make such a judgment by inspection. 
The scatter diagram, on which a freehand curve has been superimposed, 
suggests the degree and type of relatedness between the two %’ariable3. 
One could have represented the scatter by a straight line, but a curving 
line follows the contour of the data more closely. We therefore judge 
57* to be a better fit than r* and proceed with the calculation of itJ,. 

The procedure for the computation of ^ is quite analogous to that 
for Pearsonian r*. We must calculate the total sTiriation and the ex* 
plained variation and then find the ratio between them. In short. 




Explained Variation 
Total V'ariatioQ 


317 



Cotasiatton: Quantitatite Variables 

one of the most conventional, almost commonplace, measures in the field 
of statistics. But the validity of r rests essentially on two conditions: 
(1) the relation between the variables must be linear, and (2) the bivar- 
iate distribution must be homoscedastic. In general, both of these con- 
ditions are provided for when the marginal distributions are normal. 
Therefore, normality of marginals is often set down as one of the re- 
quirements of r. 

But these conditions are difficult of fulfillment, for Nature does not 
necessarily conform to the linear model. Indeed, more often than not 
the raw data come in non-linear patterns. This is especially true in the 
social sciences, where many distributions, such os income or size of family, 
are quite skewed, and the regression lines are therefore markedly non- 
linear. 

Since r sets up the straight-line model, it becomes inappropriate to 
the extent that the relation between the variables departs from linearity; 
and also as the scatter diagram becomes heteroscedastic, even though it 
maintains linearity. To be sure, since these ideal conditions can never 
be realized, statistical workers are, by dint of circumstance, necessarily 
tolerant of approximations. But there is a prudent limit beyond which 
approximations should not be permitted to extend, espeeisJly if more 
fitting measures are available. 

There are various devices with which curvilinear relations may he 
measured, only one of which will be treated here. The method set forth 
is the eoTTelalion ralio,* or ij {the Greek lower-case letter ria), but better 
expressed as i]’. It is a relatively simple procedure and adheres to the 
same principle on which r* is founded, namely, the ratio of explained to 
total variation. It differs from r* procedurally in that the explained 
variation is derived from the column (row) means of the correlation 
table instead of the hypothetical linear regression line. The correlation 
ratio, »j’, thereby supplements Pcarsonian r*, which latter enjoys a vogue 
and repute that are probably greater than its utility in the realm of 
social data justifies. 

Caladalion of Ike CoTrelatim Ratio. For simple illustrative purposes, we 
shall compute and analyze the correlation ratio between price index and 
marriage rate, 1887-1906 (Table 10.4.1), data of manageable volume.t 
Here, the sociological hypothesis is this: ns prices rise, indicating pros- 

• CqnventiQnally, "conelation rsUo” Unset] synonymously with 17 . 1“ the course of 
the discussion, it will become elesr thst h«« httle meanbg for all practical purposes; 
it is that is the meaningful measure of conelstion. Hence, the term "correlation 
rstio " will be used interchangeably irith 1 ^. 

t Since these data are in the form of s time aeries, we would normally teat them for lag 
between the two variables in order to detennice whether the data should be paired as 
pven or with a lag of a year or more, on the assumption that marriage rates would 
not react to prices until after a lag. Thb analysia will be dispensed with here. 

316 - 




r.r-+(^x.) 

-02 + (aVX5) 

-93 

5. Calculate the sum of squares in 1', or Mai rariotim, by the com- 
puting formula (see p. ICC): 

-5-[2G-^] 

- C30 

C. Compute the mean of the J'-tsIucs within each column, T., anJ 
insert the result In the row carrying that designation. Tlius, the mean 
td the first column would be: 


519 





CoTAswnoH: Qv-iXTirArm Vakiabies 



Fiflpas 10.4.1 S<«lUf Diagnn. WMttQU Priett Mar- 
riant Rout, 1887-190$ 


However, the details of the procedure vary. This ia attributable to the 
fact that the “line of regresrion,” which supplies'ua with the explained 
deviations, is now a line of column means instead of an unbroken straight 
line. Accordingly, we must determine the mean of the values in each 
column (row) in the correlation table, the grand mean of these values, 
and the deviations betwe«i the respective column means and the over- 
all mean. The entire procedure ia brat described in terms of the following 
steps: 

1. Employ the familiar procedure of grouping bivariate data into 
intervals of proper width — 5 in this instance — with E-intervals in 
rows, .y.intervals in columns. Ihe result is shown in Table 10.4.2. 

2. Detennine the marginal frequencies (/) and check the grand total. 

3. Set up code columns: p*, Jjf, as in the previous correlation 
chart. 

•4. Calculate the grand mean of the K-values by Formula 5.3.3, in 
which 7' = the midpoint of the class interval which has been coded rero, 
and t ■= class interval, 

518 




1 10.4 The Cosheiation Rino 

Table 10.4.3 Worksheei for Calculation of 



^•amtion witHn rows and columns will not be exactly equal; hence, 
t;*, will generally not be equal to »iV In tbe extreme — but verj' un- 
likely — case, there could be absolutely no Tariatioa within columns but 
a eubstantial amount within row?, eo that i??, would be unity but i;?, 
only moderately high. 

Condilitmt UnJer H'^icA if May Be £jnpfoyfd. The conditions under 
which if is applicable are more liberal than are those of Pearsonian r’. 
Ko restrictions are placed on the pattern of marginal distributions, which 

321 










CoTIBUTION: OrJiKTUATITI Variabms 


ilidpcinl 

(D I £_ 

92 1 02 

87 3 ^ 

4 353 

r.-®S«88.25 

4 

7. Find the deviation of each column mean from the grand mean: 
P, - r. In the first column; 

88.25 - 03.00 - -4.75 (Row “d") 

8. Weight each de^tion (d) by its corresponding column frequency 
and insert in proper row (Jd). In the first column: 

-4.75 X 4 - -10.00 

9. Square each exploiW denatton. Operationally, this merely in* 
Tolves multiplying d by/d, or adiaceot rowa Again in the first column; 

-19.00 X -4.75 -90.25 

10. Find the sum of the weighted squared deviationa: 

S/d* -328.88 

This is etplained ronciion/ it is the variation in Y that would have been 
observed if each F*value were located exactly at the mean of its own 
column — *as if there were no variation within columns. 

11. Divide the explained variaUou by the total variation, which is the 
correlation ratio as defioed above: 

, Explained Variation 
"" Total VanatioD 
2/d» 

" r/v* ao.4.1) 

32886 
"630 00 
« .52 

Thus, we find that 52 per cent of the variation in marriage rates is 
explained by variation in price indexes. 

In exactly analogous fashion, but operating on row means instead of 
column means, we may determine ij*, — the proportion of variation in 
X explained by Y. Upon completing this operation, which is compactly 
summarized in Table 10.4.3, we find that i)!, - .40. 

The fact that these two i7*-value3 are different illustrates the signifi- 
cant principle that, unlike r*, ij* is not reversible. In general, the relative 
320 



{ 10.4 Tbx CoBSEUiTios Ratio , 


Fiaonz 10.4.2 Cun^ineor 
Scatter 



the curve, since the straight line becomes leas and less representative 
as the data become more and more curvilinear. In fact, on the edges of 
the data, the straight line departs increasingly from the swurm as one 
projects the prediction beyond the observed values. 

Furthermore, the norming potential of r* is reduced when applying 
the linear model to curvilinear reality because the index cannot range 
from zero to unity. It will be recalled that was less than unity when* 
ever the marginal distributions were not identical. Analc^ously in this 
case, an r* of unity would be unattainable when mistakenly applied to 
noa*lmear data. But r]*, as already shown, is not so restricted, and could 
theoretically attain unity. In the case of it was suggested that under 
certain circumstances we might resort to Q; similarly, when r* is in- 
adequate, we turn to 


Preeaution in the Application of n*. In order to implement the’ formula 
of it is necessary to group the data into segments, and such grouping 
always involves a certain degree of arbitrariness. By pushing this proc- 
ess to an absurd extreme, we could conceive of as many class groupings 
as there are points, with each observed point representing its own column. 
The regression curve would then pass through every point. Under such 
an arrangement, ij* would be perfect unity since no deviation within the 
columna would be possible. On the other hood, gcoupiuga could be mode 
absurdly large, and thereby fml to etch the pattern of distribution. In 
such a case, the residuals would be too large to do justice to the curva- 
ture of the pattern, and once ^ain the familiar phenomenon of large 
grouping errors would appear. 

The general rule b to construct the columns of such width that the 
means of the columns are fairly stable, which implies that the means 

■323 




CoTAKiATios: Quantitative V'abiables 

may be normal, skewed, or even bimodal; the rcfcression line may follow 
any curvilinear pattern. Even a cpialitative variable may be used on 
one axis, although not on both (If both were qualitative, there could 
be no arithmetic means, and we would tiave to return to the contingency 
coefRcient, C.) Only one limiting factor may be suggested: the plotted 
observations should, in itencral, be homoscedastic at least in one direc- 
tion; and this for the same reason that Pearsonian r sets up the same 
requirement, namely, a synoptic measure is usually more meaningful 
when made up of homogeneous items. But this is a logical reason, not 
a mathematical restriction at all It is the same reason that sometimes 
renders a mean of widely dispersed data inadvisable. 

Pn'nctpfes of Inltrprtlation. Since the hypothetical regression )ine shuts 
its direction as a result of its curvatures and bonds, one cannot interpret 
rj as one would r: “the higher the X, the higher the Y” (technically, 
the average change in 1' per unit of A'). Such on interpretation would 
be meaningless, because the regression line may change direction; hence, 
17 is not a counterpart to the constant slope, r. Indeed, 17 ha.s no dcliiTable 
function ot all m the mea.surcment of correbtion; it is merely the square 
root of T|’, which is the proper measure of curvilinear correbtion. 

Since each sn-arm of plotted points will always have a certnin amount 
of curvillnearity in it, q*, which reflects curvilincarity and linearity equally 
well, will therefore always be a more precise and complete statement of 
explained variation, and therefore will always be higher than r*. It will 
always maximize our etatbtica! expbnation because it adapts itself auto- 
matically to the configuntion of the data. On the other hand, r* U more 
rigid, restricting Its potency to reflecting Ibe linear model. When per- 
fect linearity prevails, tj’ “ r’. 

The reason ij* is able to reflect the expbined variation more fully is 
simply that it in effect divides the curved regrtssion line into segments 
which will always be in closer proximity to the points in the swarm than 
would be a single straight line. Ifcnce, the residuals will be smaller. 
In a strictly linear swarm, with only one direction, this flexibility is, of 
course, not called for. 

In Figure 10.4.2, it b obvious that the straight line does not do jus- 
tice to the swarm os does the curved regression line. However, there 
is nothing in the mathematical processes which will make it impossible 
to use the linear formub on curvilinear data. It will “pick up’’ what- 
ever linearity there is — wWch may not be very much. A curved line 
is simply a better fit and therefore more adequate for our purposes. 

This better Jit becomes all the more evident when w© use the r^ree- 
sion line for its intended purpose, namely, the prediction of one varbble 
from the other. It is quite apparent that a K-value read from the straight 
line will be subject to much brger error than it would be when read from 

322 



§ 10.5 Intebpectation or Corazutios 


it. Specifically, our questions are: (1) what is the essential meanmg of 
correlation? (2) how competent are the statistical tools to measure corre- 
lation? and (3) how can one best interpret a correlational measure after 
it has been obtained? 

The Meaning of Correlation. It would be naive to take for granted that 
any obtained index of association between two variables is necessarily a 
measure of the true relationship between them. Nature does not reveal 
herself so readily in response to the application of a mere man-made con- 
trivance such as a formula or equation. These natural relationships are, 
after all, quite complex: they may be strong or weak, intimate or remote, 
simple or complex, genuine or spurious. 

When statistical linkages are analysed, we find that they may be any 
one of several general 111)63. These will be briefly reviewed. 

(1) Chance Relation. By definition, a “chance relation” is, of course, 
a contradiction in terms. There may be a chance joint occurrence, but 
no “chance association.” For, if there is any relation at all, the linkage or 
association is represented by a frequency which exceeds chance. Never- 
theless, the expression “chance relation” is often informally and con- 
veniently used to identify the frequencies which exemplify statistical 
independence between variables. Strictly speaking, therefore, chance 
relation is the negation of relation, statistically expressed as zero correla- 
tion; it thereby sets the lower boundary to our descriptive vocabulary 
for the complete range of relational measures. 

(2) Nonsense Correlation. An obtained measure of relation may be said 
to be a sheer coincidence when it does not represent a “true” functional 
relation. We may select any set of observations, correlate it with any other 
set, and fortuitously obtain a “correlation.” 

To illustrate: it may be shown that as the per capita consumption of 
liquor has increased in the United States the average length of life has also 
increased. Does this indicate a poritive relation between the intake of 
liquor and length of life? Does this contradict the 5tereot3rped opinion 
that liquor shortens life? Statistically, the correlation is impeccable; but 
abstract statistics are indlifereat to cioeia] content. When the social 
content is analyzed, not only do the two series of data seem quite remote, 
but they do not even necessarily refCT to Uie same members in the popula- 
tion. The individuals who “drink” may not be the individuab who live 
long. It b quite true, people in general are living longer than ever and 
liquor b abo being consumed in ever greater quantities. But these two 
sets of data may be said only to coexist, not to interact. Yule dubbed 
such mere arithmetic “correlations” noneense correlations. They are 
frequently cited by those who wish to disparage statistical method by 
showing what absurd propositions can be "proved” by statbiics. 

(3) Correlation as Evidence of Cause and Effect. In byman’s Unguage, . 

325 



Covabutiom: QcAjmrATm: Variables 


would not vary appreciably were the boandaries of the intervals nar- 
rowed or extended. Also, if the plotted points tbin out on the edges of 
the data, it is good policy to eliirunate these border data since they tend 
to increase disproportionately the over-all correlation. 

Questions and Problems 

1. Docs the calculation of q’ assume a smooth regression line? 

2. In what respect docs the correlation chart differ from the simple frequency 
table? 

3. Would it be possible to calculate q* between two qualitative variables? WT>y 
or why not? 

4. Is the numerical value of q* affected by the order of columns (rows)? 

5. Ilnder what tabular conditions would H and q’ be equal in numerical value? 

6. Show graphically the conditions under which r would be approidmatcly xero 
andq’ 1.00. 

7. Verify grsphically that q* msy be brought close Co unity Ity employing as 
raany columns as there are values. 

8. When the regression is curvilinear, why b faorooseedasticity not likely to be 
uniform in both directions? Show graphically. 

9. Caleubte qj, and q% between delinquency mtea and avenge monthly rentals 
CrabletO.2.2). 

10. Calculate between price bdex sod raarrbge rate and compare with the ob- 
tained q*-value of .52 (Table tO.4.1). 


Section Five 

Smt Central Gutdtj to the 
Interpretation oj Correlation 

The skillful computation of an abstract index of correbtion docs not 
exhaust the responsibility of the sociologbt, nor is the calculated result — 
which is often more or less routinely obtained — a guarantee that its 
sociological significance has been grasped. It b quite possible to be thor- 
oughly familiar with the purely statistical character of a correlational 
index and still be completely unaware of its sociological import. 

All statistical correlation rests on the assumption of some kind of 
cohesion in nature, whether observed directly or under experimentally 
controlled conditions. We are very much concerned, therefore, about just 
what it b that conebtion b supposed to nmasurc, and how well it measures 
324 



§ 10.6 Intehfretation of Cobbeutioh 

mistaken, the inference may be revised. Such a procedure does not differ 
in principle from that practiced by an auto mechanic in locating the 
“cause” of the rattle in an old ear, or the current procedure in determining 
the cause, or causes, of colds, cancer, or of war. 

(4) Correlaiion as a Measure of Common Factors. The high correlation 
which we could expect between economics and sociology grades could not 
be primarily explained by a causal relation between the two, but rather 
by the common factors which make for academic achievement in these 
two related subjects. The grades achieved in economics and sociology 
are merely two slightly different manifestations of almost exactly the same 
phenomenon, the components of which are: degree of intelligence, habits 
of study, grading systems, similarity of subject matter, common motiva- 
tion and interest. It would be tautolc^cal to state that the two sets of 
data, being made up of the same content, would vary together. To the 
extent, however, that other factors do intrude (for example, variation in 
teaching staffs, subject matter, and motivation), the correlation would 
be correspondingly reduced. High delinquency and high truancy may 
similarly be associated, not because one causes the other, but because 
they too, like sociology and economics marks, are characterized by com- 
mon factors: poverty, unsupervised play, absence of mother from home, 
and the like. In the field of heredity, the characteristics of siblings, which 
show many similarities, are likewise the resultant of common antecedent 
hereditary factors. In fact, this was the original conception of correlation 
as propounded by Francis Gallon, who first gave currency to the concept 
through his studies in heredity during the last quarter of the nineteenth 
century. 

The various ways in which two measured variables mignt be bound 
together by underlying common factors may be schematically represented 
by means of two intersecting circles. The presence of hypothetical com- 
mon factors is indicated by the overlapping areas of the figures. The first 
diagram would represent the case of corrdatioa between economics and 
sociology grades which are compounded to a considerable extent of identi- 
cal ingredients. The second jllustralioD pictures the very high correlation 
coefficient between test-retest scores on a social attitude inventory, 
administered to the same population before and after an intervening 
period. Diagram 3, in which there are no common elements, portrays a 
near zero correlation, an example of which might be the relation of hair 
color and susceptibility to disease. Actually, it is quite difficult to find a 
convincing illustration of genuine zero correlation; for, if underlying factors 
are pursued extensively enou^, some remote interconnecting common 
factor will doubtless be dredged up. The last figure, in which one variable 
is totally included within the other, is again an instance of a hypothetical 
possibility, but probably never in complete correspondence with fact. 

A generally humane attitude on public affairs would dictate a person's 

327 



Cotabution: QuiiimTATm: Variables 

as well as that of professionals, there is no concept so commonly employed, 
and 80 useful in the understanding of the world about us, as the concept 
of " causation.” We speak of the causes of death, the causes of accidents, 
the causes of divorce, of delinquency and crime. It is thought that, if the 
causes of delinquency, disease, and war could only be uncovered, we could 
control (heir appearance. Statistical researches in biolc^ are built 
around correlations that are derigned to reveal, for example, the effect of 
nutrition on plant and animal growth. 

The causal influence may flow unilaterally in one direction; or bilaterally 
in both directions, simultaneously. Thus, the hours of study may “cause” 
good grades, and good grades may also encourage more study; higher 
wages may cause higher prices, and higher prices bring higher wages in 
their famous spiral effect. ChiWJessness may influence divorce, but the 
prospects of divorce may cause childlessness; marriage may increase 
length of life, and healthy, long-lived people tend to marry. 

In spite of the wide prevalence of the concept of “causation” and its 
quite obvious utility, there Is in some intellectual circles a fashionable 
objection to it. The notion that one variable may exercise a force upon 
another seems to some thinkers too mystical to be credible. According 
to one school of thought, all we can ^ow is empirical association and 
sequence. As the argument of Hume runs, we cannot perceive causation, 
but only statistical contbgency; (he only statement we may make is one 
of measurement of invariant relations between massed observations. 

According to this view, therefore, a statistical measure of correlation 
does not necessarily indicate causation. We may state only that correhi' 
tions reveal a mathematical relaUon, in the aense that values vary with 
one another. Consequently, a rugged empiricist will leave the question 
open as to whether there is such a thing as causation in the first place, and 
if so, whether its direction can be ascertained by statistical means. 

Although these sophisticated questions are of undoubted fascination, 
common sense will not renounce the concept of cause. No amount of 
intellectual equivocation will raise doubts in the mind of a gardener that 
for all practical jmrposet rwn “causes” the plants to grow and not the other 
way around. An intellectual who may reject the concept of cause will 
nevertheless in real life act as though it existed. The applied scientist, 
who is called on to mampulate and control, will use the conception of 
causative factors as an indispensable guide to action. Statistical correla.- 
tions are undoubtedly an aid in discovering, if only by inference, where 
causation operates, and bow strong it b. 

The practicing statistician is tljerefcre not p/iaiarUy concerned witb the 
metaphysics of causation, even though the subject may delight him in 
his leisurely intellectual momenta. If, after various statistical computa- 
tions, one or more variables seem to produce another, ice label those factors 
as “cause.” If, after further observation and experimentation, we are 
326 



§ 10.5 Intebfhetation of Cobheuxiok 


provided for in the formula. Things would be simple if that were so. The 
difficulty is that, in addition, this measure will reflect an unknown number 
of variables which are not represented in the formula. This is due to the 
well-known circumstance that there are a large number of /actors involved 
in each event that are constantly interacting and operating through one 
another but for which there is no room in the formula. We must realize 
that a formula may accommodate relatively few variables, according to 
its design and make-up. But by restricting ourselves to the observation of 
only a few variables, we do not immobilize 'tor exclude from operation, 
within the field before us, the numerous other factors which continue to 
produce their effects on our measures whether or not we happen to be 
looking at them. For example, when we measure the ieight or weight 
of a person, we are also measuring the effects of his age, sex, race, lifelong 
nutrition, way of life, and e%’en the time of day (since height normally 
shrinks during the day, and weight increases). These factors which intrude 
in our measurements are called eoneeaUdfaciort, since the surface informa- 
tion carries the effect of such hidden factors without identifying (hem. 
In a correlation between race and death rates in the United States, the 
measure would indicate a high correlation, sisce Negroes have a high death 
rate and whites a low rate. However, in measuring Negro death rates, we 
are also unwittingly measuring the death rates 0 / the lower socio-economic 
levels to which the Negroes predominantly belong. A clasrificatios of the 
United States population by race is, to a large extent, at the same time a 
classification by socio-economie background. By corollary, we are measur- 
ing the death rates of higher socio-economic levels when nominally measure 
ing rates of the white race. Hence, we cannot take the correlation at its 
face value. The variable race is spuriously credited with the cause of 
death rates, whereas the concealed factor of socio-economic status is the 
“true” factor with which death rates are associated. 

To be sure, every coirelatioo contains some effects of concealed factors 
that are doing the work for which the apvrimu factor, named in the correla- 
tion, is superficially credited. A tpurioiu correlation may be defined, there- 
fore, as a correlation in which hidden factors are exerting the influence for 
which the surface factor is erroneously credited. Clearly, it is the inter- 
pretation that is spurious, rather than the statistical correlation as such. 

Spurious correlation is often employed as a derogatory epithet — as 
though a correlation, computed from a small fragment of our complicated 
web of life, could ever be completdy cleansed of concealed factors. Some- 
times, however, the degree of sporiousuess is absurdly obvious, bordering 
on nonsense. Karl Pearson flrst proposed the concept spurious in 1897, 
and Yule, as previously noted, tenned correlations nonsense correlations 
when the remoteness or absurdity of the association between the identified 
variables was apparent to any thinking person. 

Since there is a degree of epuriousness in every obtained correlation, 

329 



Cotabutiok: Qcantitative Variables 



1. X and Y have 
a great deal 
in common 



2, X and 1' have 
practically 
everything in 
eomisoQ 



to X, but not 
tho reverse 


Figcbe 10.5.1 Corretoioft at « «/ Cmnm FoOort 


attitude on a specific Issue, in which case the part would be included 
within the tvhole, without remainder. However, human attitudes are not 
quite as consistent as all that, for everyone shows some symptoms, however 
slight, of a Khitoid character. 

A type of correlation t^hich at first glance may appear to be embodied 
in the foregoing classificaCion is sometimes labeled ‘'part*-wfaole," but 
should more realistically be described ns “self-correlation.” It diverges 
from the preceding system in that the “common" elements derive from 
the fact that the observations are, in part, duplicated in the two scries of 
data. In a comparison of freshmen and all-university grades, the freshmen 
grades appear as duplicates within the total university observations. To 
the extent that freshmen arc a smaUcr or larger component of the respective 
student bodies, the resulting correlation will automatically be lower or 
higher, for the very evident reason that correlation between identities is 
bound to be perfect. The result will be n truism rather than a revelation. 
It is even conceivable that the “part” will show little correlation with the 
remaining components and yet, because of the size of the common part, 
yield a considerable, but redundant, correbtion. It raises a question 
whether such a computation should be undertaken at all, and whether tt 
would not be more prudent to correbte the individual components among 
one another. ' 

(5) Spurious C<yTTt\ai\(m. An index of correbtion presumably measures 
the degree of rebtionship betwewi the two (or more) varbbles which are 
328 




§ 10.5 Intiepeetation or Correlatios 

constitute nature and society, and it ignores the useful services which 
these concepts have already administered. Far from being supernumerary, 
the TOriety of correlational measures m^ht with better logic be construed 
as evidences of adaptability to the intricacies of nature. 

Badcally, the suitability of a spedfic correlational formula is judged 
according to the following three general criteria: (1) the pattern of rela- 
tionship between the specified \’ariable3; (2) the nature of the data, or the 
form in which they are expressed; and (3) the desired degree of accuracy 
which is differentially yielded by the equally available measures. Each 
of these criteria will be disc\is3ed in its essentials only, for it is not necessary 
at this stage to reiterate the detail now familiar to the student. 

(1) The Pattern of Relationship. The multiplicity of indexes derives, 
to a very large extent, from the fact that “relationship” is not a monistic 
concept; hence, no single inde.x could ever presume to be omnicompetent 
in the presence of the various types of configuration of the interrelation- 
ship. Thus a linear relation is distinguishable from curvilinear inter- 
dependence: a linear relation b defined by a uniform rate of change in 
one variable for every unit of change in the other; while a differentially 
changing rate characterues the curvilinear relation. A relation between 
only two variables b not identical with tb^ between three or more vari- 
ables, since, as the number of variables mcreascs, the network which 
connects them becomes more intricate. Relations may abo have a direc- 
tional dimensioa, conceived os one- and two-way dependence. A detailed 
familiarity with the respective formulas and their derivation will reveal 
the subtler differences between the various connotations of “relationship” 
which the symbob are designed to express. 

(2) The Forms of the Data. Data do not always come in identical forms 
and consequently are not measurable in the same units. Attributes are 
enumerated in frequencies, and variates are expressed in magnitudes. 
Since a formula must have the capacity to digest the data which are fed 
into it, we will require in general as many types of formulas as there are 
different forms of data. 

For example, attributes are presented in r x c contingency tables, and 
their association b measurable by the degree of discrepancy between 
chance and observed joint frequencies. Quantitative variables, on the 
other hand, are presented as magnitudes whose correbtion b measured 
by the concomitant variation symbolbed, for example, as r and t). 

(3) Degree of Precision Required. At hb present stage of development, 
the student’s statbtical mentality should recognize that nature cannot be 
clearly duplicated on paper in the shape of formulas and sjtnbob. There- 
fore, any formula b but an ideal-typical construct which only approximates, 

•more or less, the raw, fragmentary data tom out of context. Thus, Pear- 
sonian r assumes that the bivariate data are dbtributed normally; ^ works 
best when the marginal sets are identical. But nature in the raw never 

331 



Cotaeiation: QuAJrriTATrvx Vabublis 

the concept loses its characteristic as an epithet. It is a foregone conclu- 
sion that social behavior is complex, and that no correlation is actually 
nhat it seems. Depending upon one’s detailed interest, every correlation 
should be analyzed and broken up, and cwild be pursued indefinitely to 
greater refinement. No one, for example, would reject or undervalue the 
association between moisture and plant growth because of the many inter- 
renin^ cariables which actually “do the work” being measured in the corre- 
lation: between moisture and its ultimate effect are such factors as the 
solution of chemicals in tbft sml, temperatiire, proximity of other plants 
which compete for the moisture, and sunlight. 

Is there, then, any routine test by which we can tell whether a correlation 
is spurious or genuine? Actually, as has already been implied, no correla- 
tion is wholly genuine in the sense that hidden factors are completely silent. 
But statistical devices are available which will "eliminate” one or more 
of the spurious factors and lay bare the “genuine” factors, thus measuring 
their separate effects. Simply put, this involves introducing additional 
variables into the system, and testing them for correlation. In general, if 
the additional variable incorporated into the computation changes the 
original index, or leaves it unaffected, we may state, respectively, that the 
original correlation was spurious or genuine. 

More specifically, there are three relatively eiinple, interrelated methods 
by which additional variables may be incorporated into the measure of 
relation, two of which have been previously discussed. (1) The method 
of Ronning by subclassification has been presented as a tool for testing the 
genuineness of an observed rebtion (Chapter 7, Section 2). (2) Standard- 
ization serves additionally to summarize the specific classes of observations 
into a single measure, by which we may test the validity of the rebtion 
between variables. (3) Parri'of cvrrelalions are designed to measure the 
correlation between two specified variables while one or more others are 
statistically held constant. The technique of partial correbtion, which 
b not presented in this text, b merely an extension of the Pearsonian r. 
In general, partbl correlations require more elaborate computations, but 
invoke no new principles or concepts. 

IFAy So Afony Types oj Corrtlatim Afeosures Are Retjaired. When con- 
fronted with the long fist of eorrebtion measures — Q, <i>, p, C, r, t], as well 
as numerous others not represented in these chapters — a student might 
well become skeptical of the validity of a concept which yields so many 
different answers to the same question: “How strong b the affinity be- 
tween two (or more) sets of data?” Do they all possess the same degree 
of competence to answer thb question? Does not the multiplicity of 
answers to the same question tend to discredit the validity of them all? 

Such healthy though immature statistical agnosticism does not evince 
adequate appreciation of the mctremely intricate web of rebtions which 
330 



$ 10^ Intcrfeetatiok or Cobselatiok 

there are certain tj^pical pitfalls into which students may stumble, and 
some of these are here itemized as aids in interpretation. These hints 
may be analyzed and illustrated the student, and will go far toward 
enriching bis understanding of correlation in general, and of social science 
as welL 

(1) Time Lay. Two series of data, such as pri(»s and wage^ may be 
wry definitely linked, each infiuencmg the other. Howe\’er, the infiaence 
of one on the other may occur only after a lag of a period of time. Conse- 
quently, to measure the inherent rdation between the two time series, it 
would be incorrect to pai^he data within the same year; rather, we should 
pair them diagonally, at intervals of a year or two, depending on the esti- 
mated lag. After a little experimental pairing, the most plausible time lag 
will be ^e^’eaIed by the highest attainable correlation. Such a syncopated 
pairing would not, of course, be justified when the time factor is not func- 
tionally involved. 

(2) J/eosvrea Applied to Helerogeneout Data. The non-uniform scatter 
(e.g., gourd-shaped) around the line of r^ression is evidence of hetero- 
geneity of the data — a distribuUon that is called keleroscedattic. Since 
a wide scatter indicates low correlaUon, and a narrow scatter high correla- 
tion, the resulting r does not do justice to either extreme. Such an r 
would be analogous to combining in one mean the salaries of janitors and 
managers, with a consequent equal representation of both groups. Under 
most circumstances it would be far better to segment tbe data into more 
homogeneous groupings to obtain a measurement which wcruld do justice 
to the high (or low) correbtion in tbe respective subgroups. This is not 
an illegitimate procedure for the purpose of pushing up the correbtion 
measure. Quite to tbe contrary, it b an intelligeDt procure in the analysb 
of data to arrive at more realistic and serviceable indexes. 

(3) jSpecj^oly of an Index. We most never extend the generalization 
beyond the specific context in whidi tbe data are found. It would be an 
error to orergenetalize in the following manner: “The correbtion between 
delinquenej" and monthly rentals b —.59." It would be more accurate to 
say; “The correbtion between rentals and delinquent, in a particnlar 
city, at a given period, and under given conditions was —.59.” If, for 
example, new definitions of delinquency were to be imposed, or if rent- 
control bw3 were modified, or if any one of a dozen relevant factors in tbe 
social setting were modified, the correbtion would have been altered. It 
b this circumstance that should warn us against quoting or applying ob- 
solete correbtions as though they pertained to a current setting. Correb- 
tions, like the folkwaj's of society, are culture-bound. Extrapobtions are 
therefore always hazardous. 

Thb caution will probably elidt conriderable methodological discu^on 
on the cumubtive character erf social research, as compared to the cumula- 
tive poteatbllUes of phyrical bboratory science, where the contextual ' 

333 



Gotabiatiok: QoAimTATrvE Vabiables 


satisfiea such ideal assumptions. This being the case, why not invent 
procedures which are grounded in “reality” rather than in ideal conditions 
which admittedly do not exist? The answer is that it is impossible to 
invent a formula for every eoncdvable set of contingencies or for every 
possible gradation in their forms. Just as the law of falling bodies is 
standardized for a vacuum, so the correlation formulas are standardized 
for ideal conditions which, in fact, do not exist. What is a still greater 
violation of nature is the necesMQf of segmenting and segregating the 
objects of study, and abstracting them from the raw context in which 
they normally operate. 

Considered from this point of view, a formula is a human contrivance 
like a machine, which is supposed to work for us. We may ask it to work 
on materials for which it is not intended. This would be, of course, a 
blunder on our part, ascrihable to deficient experience and knowledge. 
Or, what is more likely, we may put in materials for which it is only moder- 
ately well suited; or again, we may as^gn it a task for which it is eminently 
well suited. 

What we are saying is that precision is relative to our needs, our inter- 
ests and the potentialities of our measuring tools, and that occasionally 
it may be a matter of selecting one of several alternative formulas. Thus, 
r and are to a slight degree Intercbangeable, depending on the degree of 
tolerance on the part of the worker; p is similarly acceptable for r; ^ is 
another form of r with possible alternation between them. 

CtmverttbiUiy <?/ Indezet. At times, we may have left the impression that 
the various formulas were mutually exclusive; however, it has also been 
intimated that some of them possess certain elements in common. Thus, 
the Coefficient may be viewed as the r formula tailored to dichotomous 
data; and r* are identical measures when the relation is perfectly linear; 
p, based on ranks, is also a derivative of Pearsonian r. Such reflections 
testify once again to the ingenuHy exercised in the invention as well as in 
the implementation of formulas in our effort to penetrate the secrets of 
nature. 


Pitfalls in tht InUrpretation of Correlations 
It has been repeatedly empharized that a correlational measure should 
not necessarily be taken at its face value. For example, a Pearsonian r 
of zero does not prove the absence of a substantive relation between the 
two series of data: we may have employed the wrong formula; an r of .3 
might be substantially raised or lowered by subclassification. How can 
we ever be sure of the validity of Hie index? The answer is, of course', * 
that we may never be absolutely sure. It is necessary to be pragmatic and 
interpret the results in terms of relevant knowledge. At the same time, 
332 



5 10.5 lUTeRPKETATlOJt OP CORSKLAttOH 

under study, although Ms memory and understanding may accommodate 
a very large number of variable^ depending upon his experience, intelli- 
gence, and opportunities for observation. It is therefore highly probable 
that on many occasions an experienced worker -R-ill, in a certain sense, 
know much more than bis formula does. He can attend to a larger number 
of variables, but probably not qmi6 so accarately as can his mathematical 
tools. A good worker will alwaj-s combine and blend the potentialities 
of his mathematical tools with the subtlety of bis own imagination. 

What Is a *'High” Correlaiionf A corrdation is judged “high” or “low” 
according to the same standards ns any other phenomenon is so judged: 
either in absolute terms, or relative terms. In absolute terms, a correlation 
is high or low as it approaches its possible numerical limits. Hence, .9 
is high, and .2 is low. However, in terms of the norms of human expecta- 
tion and demand, absolute terms are not \'eiy meaningful. A correlation 
of .7 would be considered very low for (cst-rete*t problems, whc’^ the 
reliability of the test is in question, but very high for the correlation be- 
tween IQ and grades. Famifiarily with the scientific norms prevailing 
in given situations offers the only guide for useful judgments. 

Are Low Correlations AeoeptableT Is a zero correlation a satisfactory result 
in sociological inquiry? Orio may assert that science is impartial and is 
not interested in any particular findings, and hence a zero correlation is as 
satisfactory in principle os a high correlation. It is true, the scientific 
attitude is not biased in favor of any particular finding. However, there 
is a difference between a “finding,” and a scientifically acceptable correla- 
tion measure which answers in pari the question which prompted the 
investigation in the first place. Science has as its objective the completest 
possible explanation ; a zero correlation explains nothing. 

This is not to say that zero correlations may not have their uses, for 
they may explode long-cherished myths and eliminate tentative working 
hypotheses. But constructivdy, they represent failure, for they add 
nothing to human knowledge. Humanly speaking, zero correlations repre- 
sent statistical independence; they permit no prediction beyond chance, 
which is the ultimate of ignorance, rendering adaptation or survival diffi- 
cult or impossible. In a random world, we could never learn by experience. 

It is one thing to say that science is not biased in favor of any specific 
finding (that is, any specific prediction) but another to state that science 
is not interested in prediction at all (that is, accepts zero correlation). 

Questions and pROSimMs 

1. Distinguish between nonsense and spurious correlation. 

2. Discuss in what manner knowledge of subject matter will influence the inter- 
pretation of obtained correlation measurea. 


335 



Cotaeiation: Quantitative Variables 

circumstances are more constant. The potential non-comparability of 
social data is admittedly a persistent barrier to progress in the social 
sciences. The United States Census, one of the most important sources 
of quantitative data on a large scale, baa modified the definitions of family, 
employment, occupations, urban areas, and other basic concepts at inter- 
vals of tea years, wluch is one formidable example of the relevance of this 
principle of specificity. 

(4) Correlation Betieeen SynopHe Measttres. The data for correlations 
are frequently in the form of synopric measures, such as means and per- 
centages. The mean, for example, has the familiar effect of automatically 
eliminating all variation, and thereby reducing error of an undetermined 
amount. Since variation is a form of error, its reduction or elimination 
will always tend to raise deceptively tbe coefficient of correlation over 
what it would have been had tbe original itemized data been used. Special 
care must therefore be exercised in interpreting cdrrelations of synoptic data. 

Thus, percentages may be a source of misinterpretation. The per- 
centage of illiterates by states, correlated with the percentage of Negroes, 
would yield a rather high UKiex. It is, however, conceivable that a la^e 
proportion of illiterates in each state are rural white, and thb would tend 
to contradict the impression of tbe correlation between race and illiteracy. 
A more forthright procedure would be to cross-tabulate individuals by 
race and literacy. If such detailed data are not aNmilable, it still devolves 
upon tbe investigator to exercise imaginative restrunt in his interpretation. 

(fi) General Sociolo^eal ContUkrations. All general precautions ob- 
served in ordinary statistical procedures must also be taken in correlational 
computations. Uniform de^itions of terms, units of measure, and care 
in collection of the data are prescribed for statistical workers of every type. 
But the specific pitfalls vary for different types of computations. Correla- 
tions, especially when used for comparative purposes, may be entirely 
niisleading when definitions of items are vague and unreliable. Thus, a 
compilation of crime rates in different areas of the United States, where 
definitions of crime as well as court procedures vary, might be fairly 
meaningless unless such definitions were standardized. A correlation 
between monthly rentals and ddinquency rates in places where rent control 
prevails would yield a different measure from what we would find in an 
area or chronolc^cal period where rents are determined by normal com- 
petitive economic processes. It is dear that no correlation is "terminal” 
but is susceptible to continuing analyris, refinement, and extenuating 
interpretation. To carry out such functions requires not only a competent 
statistician in the mathematicai sense, but also a worker competently 
trained in the intricacies of the sociad forces operating in the area of o1> 
servations. 

An investigator must never lose right of the obvious fact that a correla- 
tion formula usually reflects only a few of the variables in the context 
334 



Sampling o 


The Sampling Attitude 

The Purpose of Sampling. The broad um of statistics is to describe and 
summarise mass phenomena like birtba, deaths, and income, and their 
interrelationships. However, it is often neceasaiy or practicable to base 
such description on a fraction of the total aggregate, and sometimes an 
exceedingly small one at that. Sucb an expedient may be, and usually is, 
quite satbfactory. It is astonishing bow effective a weU*selected frag- 
ment can be: a small snippet from a bolt of cloth; a few drops of blood 
from the patient’s total supply; a few thousand survey votes, by which we 
describe the political Intentions of millions of voters. Such procedure 
is standard practice in everyday social and economic life, as well as in the 
branches of scientific activity. In instances of this kind, when the data 
are partial rather than complete, sod when they are used to characterire 
the entire set, we call the fragment a aompfe, and the total aggregate a 
untVerse, or population. We name a specified value of the universe, such as 
the mean, a parameter, and its counterpart in the sample we term a statistic. 
The objective of sampling is, therefore, to draw an inference about the 
parameter, which is unknown, from the aample statbtie which is observed. 
This process of generalizing in a prescribed manner from sample to universe 
has come to be known as statistical inference. 

Although statistical inference as a forma) quantitative technique is 
principally an achievement of the twentieth century, its underlying motiva- 
tion b as ancient as mankind itself. In hb continuing effort to adjust to 
the environment, man has necessarify been obliged to treat an isolated 
experience as typical of a larger sj'stem in order to profit by it. Once stung 
by a bee, or burned on a stove, he b likely to view such objects as persbting 
hazards to be avoided. On the same principle, but in a more deliberate 
manner, he samples one peach from the bushel, plugs a watermelon, or 
gives the new car a trial run. 


337 



Cotabiation: Quantitative Variables 

3. Illustrate, from your own eicpcriencea, if posmble, the concept of nonsense 
correlation. 

4. Give several illustrations of spurious eonelation, and show how the spurious 
element may be isolated. 

5. Itemise the different conceptions of correUtmn, and show how they are inter- 
related. 

6. Define and illustrate a concealed factor, and show how it may deceive an 
observer. 

7. Explain the seaeral principles which justif/ the use of various measures of 
correlation. 

8. Define a synoptic measure, and show how it may create problems in the inter- 
pretation of correlation. 

9. Explain why aero correlations ate unaalbfactory from the standpoint of the 
objectives of science. 


Selected References 


Exekial, hlordeeai, and Kart K. Fox, MelMi of Centlalim and Rtfruiion, 
3d edition. John Wiley A Sons, Ine., New York, 19S9. Chapters 8 
and 2S. 

Hyman, Herbert, durvey Duign and Analgti*. The Free Prese, Cleaeoe^ 
Illinois, 19SS. Chapters S and $. 

Kendall, Maurice, Rent C<rrT(tat»»n Ifetlbd*. Qiarles Griffin, London, 
IMS. 

Itobinaon, WUliam S , "Ecotopcat Correlations and the Behavior of 
Individuals, '’Amm<an5ocMoptcotBenne,Xy,1950. Pages35i-357. 

Simon, Herbert A., Afodel* e/ Stan. Sociel and Rational. John Wiley & 
S^,Inc.,NewYork,1957. Cbapter«l-3 

Enedecor, George, SlalCtrient Jt/tAodi, htb edition. The Iowa State 
College Press, Ames, Iowa, 1956. Chapters 6 and 7, 


336 



SAMPLDfO 


couple in a large city, he may find such a program impracticable to execute 
because of limited financial resources. He will then be content with a 
carefully selected sample. But this is not an unusual restriction. Research 
workers in general e.xpect to arrive at TOlid generalizations on the basis of 
sample materials, presumed to be representative of some wider domain. 
Thus, the anthropologist interrogates a few native informants in order to 
reconstruct the entire pattern of a culture; the U.S. Children’s Bureau 
relies on a sample of juvenile courts in order to establish national trends 
in delinquency. 

Moreover, there are times when sampling is not only practically advan- 
tageous, but an almost inescapable necessity. ^\Tiea we destroy an object 
in the very act of measuring one of its characteristics, we must either 
sample only a few or destroy them all. This goes by the name of <Ustntciire 
sampling. For example, it is impossible to measure the life span of an 
electric light bulb nrithout end'mg its life; consequently, to approximate 
the average life span of a large lot, the manufacturer necessarily* resorts 
to a small sample. Analogously, the phy*sician cannot afford to sample 
more than a few drops of blood without causing undue discomfort in his 
patient. 

But destructive sampling may sometimes be abandoned, if we are un- 
willing to tolerate the destruction of even a single item. Where human 
life is jeopardized, we are not drilling to risk the loss of a single sample case. 
And e\'en when the sampling is not harmful, social considerations inhibit 
tu from freely carrying out experimentation on human beings. It is only* 
when an occasional life convict or conscientious objector voluntarily sut> 
mits to a possibly harmful experimental treatment that pubhe opinion 
permits destructive sampling to be carried out on human ^ings. In the 
usual case, such eampUng is transferred to dogs and monkeys, who ser^-c 
as stand-ins for man. 

Sample materials are thus the stuff of which scientific generalizations 
are made, but the sampling method involves more than mere recognition 
of that principle. Modem sampling practice is disUnguished by (1) its 
emphasis on the well-defined universe, (2) the random selection of ca.'cs, 
and (3) the estimate of the reliability of the sample statistic — that is, 
how closely it probably conforms to the unknown parameter. The^ 
concepts form the ol the discussions in and the following chapter. 


Tiv Sl3>bt:cal V/sirtru 

Dtfinilwn of Unirrrie. In colloquial epeech, the term unirerre suggests 
the entire Creation. But in statistical language, it refers merely to a 
totality of \'alucs possessed by elements h3>'ing a wclMeSnrd common 
characteristic which determines membership in the wt. Such sets are 
also termed statistical populations. We may cite as universes: the incomrs 

339 



Sakpuko 


Moreover, fresh samples of experieoce continuaUy stimulate us to re- 
examine our previous inferences. Thus, the' success of one woman in 
Congress will cast doubt on the previoudy held opinion that women are 
politically incompetent; the grade records of Negroes in northern schools 
will force a revision of the stereotyped belief that Negroes possess an 
inferior mentality. Thus, we modify our premises in response to the 
unfolding sample evidence. In the language of statistics, we afford our 
initial hypotheses an opportunity to be revised or nullified. 

^Vhjle every man thus reasons informally from part to whole, statistical 
inference is much more rigorous such mere folic practice. Sampling 
has come to be a body of teidmical procedures which must be deliberately 
applied and strictly adhered to if its goals are to be fully realized. Thus, 
the statistical universe must be well defined, and the sample must be 
properly drawn — two basic operations which are much more involved 
than appears on the surface. In fact, the sampling system, as it has evolved 
in the twentieth century, is essentially a technical accounting device to 
rationalize the procurement of information. By this criterion of efficiency, 
it is always preferred to complete enumeration when it yields data of 
requisite accuracy, ance such a procedure b obviously more economical 
in a busy world of limited resources. 

Further, the logic of sampling is reinforced by the generally accepted 
view of nature as being orderly and predictable. Ihe discipline of etatistiea 
has not always enjoyed such a favorable climate of opinion, especially 
when applied to human behavior. In the mid-niDeteenth century, Qu^telet 
and Buckle were denounced as materialists because they drew inferences 
from statisUeal records on the recurrences of erimee, suicides, and. other 
human actions that ran counter to the (hen current doctrine of free will. 
It is therefore no accident that the sampling method has been cultivated 
most intensively and that the sampling mcnlality has flourished in the 
scientifically-mmded, efficiency-oriented rational culture of the modem 
^’estem world. 

Advantages of Sampling. The collection of sample data naturally requires 
less time and effort than does the compilation of complete data. Hence, 
surveyors of American public opinion universally avail themselves of 
samples of respondents, since results based on time-consuming total enu- 
merations would be obsolete before they could be tabulated and published. 
Similarly, since 1940, sampUng has been extensively employed by the 
U.S. Bureau of the Census in its decentual enumerations to provide more 
promptly detailed descriptions of various characteristics of the American 
population. 

S'mee sampling b also less costly, it may be quite feasible when the 
financial burden of full coverage fa prohibitive. Although a free-bnee 
sociologist, for example, may believe h deniable to interview every divorced 
338 



Saupliko 

verse contains a countatle nmnber of elements. It may be relatively 
small, as, for example, all students enrolled in a particular college in a 
pven year; or it may be relativrfy large, as all coU^e enroUees in the 
United States. But the infinite universe consists of an endless number of 
elements, such as an unlimited number of penny tosses or other experi- 
mental trials. It is thus purely conceptual, and may even Beem meta- 
physical to the finite mind. And yet it is often heuristically postulated in 
statistical inference. For one reason, an infinite population permits the 
reliability of the sample findings to be more simply evaluated by fonnula 
than does a finite population. Consequently, xre resort to the assumption 
of an infinite population xrhenex'er the eixe of a finite population is large 
enough to justify it. AdditlonaDy, it may be invoked when it is not 
reasonable to limit the size of the universe at all, as in the of infinitely 
repeatable experimental trials. The laboratory dog may die after an 
experimental injection; but the sdentist does not restrict his generaliza- 
tion to the dead dog. His ultimate interest lies in the potentially endless 
succession of experimental trials ideally performed under identical coD(fi- 
tions. Onto this infinitely large uojv*erse he fastens his generalisation. 
Analogously, but at considerably greater risk, the social sdeotLst may 
conceptually extend bis findings in one or a few cases to the bTpothetieal 
infinite class of presumably identical events. One family, one community, 
one culture, one bureaucracy — each is its turn serves as a prototype to 
which all future occurrences of the same general class are presumed to 
conform. 

(3) The universe which is actually sampled (the sampled unictrse) will 
not always coincide with the universe on which our sights are fixed (the 
targtt tmicerse). The target unh'etse represents ideally the territory we 
intend our generalizaticns to cover — the domain to which we eventually 
apply our sampling knowledge. Our ultimate interest may lie, for instance, 
in the patterns of adjustment of aO married students on campuses in the 
United States, but for practical reasons it may be necessary to restrict 
the sampling to the available couples on a particular campus, and these 
couples then become the sampled universe. When a mailed questionnaire 
is returned by les than 100 per cent of the sample, as is usually the case, 
we may conceive of the target universe as the complete mailing list from 
which the sample was selected, and the sampled universe as all persons 
on that list who would IheorcticaDy return that questionnaire if given an 
opportunity to do so. Th’iS, the concept target uruterte may be. apphed 
brcadly to an idealized exteorion of the sampled unlrme, or narrowly to a 
universe in which a fraction of the units are for one reason or another 
inaccessible to measurement, each as refusals or not-at-homes. 

Strictly speaking, statistical inference should be rigorously limited to 
the sampled universe; and yet the social anal>*st can hardly refnun from 
speculating about his target cniverae. No research study in the social 

341 



Sasifuxq 


of all American families in 1960, the oj&iions of all college students on the 
subject of war, or the social-status raUngs of all residents in Yankee City. 
From these examples, we discern that the statistical universe may be 
conveniently conceived of as being dualistic in nature: (a) one dimension 
consisting of the units (e.g., famihes) which are actually sampled and which 
are therefore called sampling un«£«; (b) the other dimension being the 
tam-pling trail (e.g., income) possessed by the sampling units, which is 
subsequently manipulated statistically. 

It is the sampling units that are phyacally selected: families with 
incomes, farms with acres, students with opinions, workers with occupa- 
tions. Further, the well-defined criterion which determines eligibility for 
such selection here appHea to the sampling units rather than to the traits. 
Thus, a population of families consists of all human groups that satisfy 
the working definition of a family. 

On the other hand, it b the variable properties of the sampling units 
that command our ultimate interest. We are seldom if ever interested 
in families qua families; rather our interest will lie in one or more of their 
relevant traits: income, nationality, die, socbl status, relipon, and so on. 
It b variables such as these that are subjected to statbticsl measurement 
after the sampling units have been drawn and their characteristics deter* 
mined. 

When, for any reason, it b unnecessary to identify explicitly both aspects 
of the universe, we may quite properly employ elliptical statements sudi 
as "the population of coUege students" or "the universe of attitudes." 
But such abbrevbted statements omit the sampling trait of attitude (of 
the coQege students) in the first instance, and the sampling unit, the college 
student (whose attitudes are polled), in the second instance. Although 
terminology varies among statistician^ we here cemedve of the "universe” 
as compridng both the sampUng unit and the corresponding sampUng 
trait. 

Univme CUusifieation. Statbtical universes differ in various ways and 
may therefore be classified according to various criteria. Three of the 
more important classifications are here bri^y set forth to amplify the 
sampling concept and refine its implementation: 

(1) The universe wQl be quolftattre or quarditaiite according to whether 
the traits of which it b composed are attributes or variates. Thb familiar 
dbtinction implies that the statistical description of the universe will take 
the form either of arithmetic averages or frequency counts and percentages. 
For example, the incomes of American families may be represented by the 
mean, whereas the occupations of American workers can only be grouped 
and counted and expressed as percentiles of the total. 

(2) Ibe universe may be Jlnile or infinite, depending on whether the 
sampling units are finite or infinite in supply. By definition, a finite uni- 
340 



SucrLiwa 


using persons enrolled in English Composition, as this subject is required 
of all. Again, reasonable encrugh on first thought, this alternative will 
seem less satisfactory when we realize that only underclassmen are en- 
rolled; mature upperclasmen would have no opportunity to be included 
in the sample. 

Many other possibilities n-ill readily suggest themselves in the search 
for representative co%'erage. Thus, we might consider can^Tissing the 
men's dormitories. But this scheme rather glaringly omits women, and 
so would have to be modified to include women’s residence halls as welL 
Even with tlus modification, the plan is rather obviously unsound: it 
makes no provision for students housed in fraternities or for those living 
at home. Apparently, other plans will have to be tried until an adequate 
one is de\'eloped. 

The general shortcoming of the aforementioned alternatives will be 
recognized by e^'en the casual reader: they do not afford each sampling 
unit an equal opportunity of bring selected. Ideally, of course, our 
sampling procedure should give to each unit in the population such an 
opportunity. Methods which meet that criterion are named random 
eampfinj; proeodimt; methods which offer no such assurance may be 
conveniently labeled non-random procedures. When such equal oppor- 
tunity of selection U not provided for'by the sampling technique, it will 
generally be impossible to vouch for the representativeness (reliability) 
of the sample, and consequently it will be impassible to generalize con- 
fidently from the sample back to the universe. The fulfillment of the 
purpose of sampling is jeopardized. 

I^on-random procedures are therefore seldom considered to be ideal; 
nevertheless, they are often justifiably resorted to in social research 
because of practical necesrity. Banning with the least useful, several 
are presented here in ascending order of utility so that the student will 
be familiar with their respective merita and shortcomings. 


Non-randtsm Proctiurts 

Haphazard Samjiing. The acceptance of whatever cases one fortui- 
tously happens to encounter, without any consideration whatsoever for 
their degree of representativeness, may be termed haphazard sampling.* 
This practice is exemplified by the old-fashioned straw vote in which 
citizens are accosted in the street to ascertain their voting intentions. 
From these straws in the wind, the election “forecast" is made. The 
obvious objection to such casual procedure is that the man-in- the-street 
simply is not representative of the total electorate. Similarly, in a sur- 
vey of student opinion on the propriety of final examinations, those 
* BynonytDotu eoQwpta that appear in ataUatical writing include aeeHenial tampiing 
and cenrenvenee tampCinf. 


343 



Sahtuko 


sciences would ever be made if its findings could not be imaginatively 
extrapolated beyond the limited universe from which the sample has 
been derived. Such speculations are both justifiable and desirable, pro- 
\-ided that their tentative nature b recogmred and understood. A sam- 
pling study of mental health and soeial class in a New England community 
b of significance primarily for the light that it sheds on the relation between 
social structure and personality deterioration in other American coinmum- 
ties. And so the investigator quite naluraBy probes his sample materials 
for their wider generabty and makes the most of hb costly data. In pro- 
jecting his findings onto the raguely defined target universe he necessarily 
proce^ without benefit of strict reliability procedures. Yet he may be, 
and usually b, engaged in fruitfuf and necessary scientific activity. Never- 
theless such a liberal statistical morality, which b practiced by even the 
most thoughtful and productive social scientbts, b by no means license 
for irresponsible and sloppy statbUcal generalizations. It should be per- 
mitted only to evperienced isvesttgatora. 


Sampling Procedures 

Problem oj Sampling. The process of sampling b in its seientifie sense 
a technical operation which must be conducted according to standard 
prescriptions In order to secure all of iU benefits. In fact, costly soeial 
investigations have sometimes been severely blembbed because the sam- 
pling tactics were crude and inadequate. Because of the admitted di&i* 
cutties in sampling human populations, the discussion of the theory and 
practice of sampling must always occupy an important place in the do- 
main of sodal statistics. 

Conrider, by way of illustration, the task of sampling a set of college 
students — a common assignment for majors in journsliszn or sociolo^ 
— with a view to generalizing about the entire student body. Dis- 
regarding momentarily the kind of data sought — attitude toward com- 
munism, number of dates per week — how may we obtain a aample 
which will do justice to the student body? Thb b no simple task. The 
sampler might pve way to the first impulse to take all persons enrolled 
in elementary sociology — a rather attractive posribility since such classes 
include a wide variety of students and, in addition, are easily reached 
and manipulated for experimental or survey purposes. However, on 
second thought, he will realize that college courses are almost sure to 
exert some selective influence among students. Thus, sociology b likely 
to -attract persons whose primary interest b in social issues, but may 
hold no appeal for those principally interested in the physical world. 
In restricting the sampling to aociolt^ students, there b therefore a 
danger of excluding- certain types of individuals who are negatively 
selected by thb subject. To forestall thb outcome, we might propose 
342 



SAMFLraa 


nighthawks whom we happen to find brraWasting in the coffee shop at 
11 A.M. will be something less than s fair cross-section of the entire stu- 
dent body. Such crude chunks of data are ‘‘samples” only in the loos«t 
sense of that term. When seriously used, they constitute an unflattering 
reflection on the sophistication of those resort to, and accept, such 
data. 

/Iwi7abihls/ Samphnff Although fully aware of the limitations of non- 
random sampling, the experienced social Kientist will sooner or later 
realize that some form of it is often the oidy alternative to abandoning 
the inquiry. In many instances, be may be required therefore to seize 
whatever opportunity is available. The Kinsey survey of sexual behav- 
ior in the United States male population was severely criticized because 
it was based to a large extent on data provided by solicited subjects who 
made themselves available. It was plauribly contended that persons 
who volunteer to provide information about their sexual behavior are 
likely to differ significantly from persona who decline to be Interviewed. 
But the investigators considered the acceptance of volunteer respondents 
as a pragmatic solution, rather than the preferred procedure. It was 
their reasonable assumption that many random selectees could not be 
induced to relate their sex histories, and that any possibility of obtaining 
a completely random sample was precluded. 

For similar reasons, other types of sociological studies must often rely 
on available opportunities. For example, it may be impossible for the 
behavioral scientist to sample all workers ia a given industry, but yet 
possible to observe workers in a local plant; impossible to poll a group 
selected from all school children in the state, but possible to poll those 
attending various local schools. Social psychologists axe Ukely to find 
littlehumor in the sarcastic remark that their broad "universal” generali- 
zations are founded on experiments on college sophomores. Above all, 
they would prefer to study all kinds of individuals — old as well as young, 
non-college as well as college — in formulating tbe laws of social learn- 
ing. But it b often a question of sampling the available students or 
otherwise abandoning the project. Hence, they must convert a captive 
audience of students into a "valid” aample. Similarly, research on 
delinquency is almost always based on the available delinquents — boys 
on probation, boys in court, or boya In the industrial school. Regardless 
of how desirable it might be to study a group selected from all delin- 
quents, the unapprehended as wdl as the apprehended, such comprehen- 
sive sampling is for obvious reasons out of the question. 

It would therefore be pedantic to deny the uses of available opportu- 
nities, even though they do not yield ideal data. Social scientists, like 
everj'one else, must often content themselves with compromises. Not- 
withstanding their shortcomings, avsdlability samples do yield signifi- 
344 



Saufuno 

in 25. How is this to be interpreted? The layman would reply: “Since 
the human sampler cannot influence the dramng, each guest has the 
same chance.” But the statistical interpretation lies in the very defini- 
tion of probability, which applies to mass phenomena or conceptually 
repeatable trials. In this instance, “same probability” refers to the 
expectation that each name would be drawn an equal-number of times 
if identical dranings were conducted indefinitely. Of course, an empirical 
demonstration of innumerable drawings is not undertaken; nevertheless, 
it is tacitly assumed that the a priori expectation of equal occurrences 
would eventually be confirmed. This principle is intuitively recognized 
by the losing guests who confidently console themselves mth the bromide, 
“better luck next time.” 

Of* course, equal probability of selection cannot be assumed if some 
cases are less accessible than others or if the selection mechanism is func- 
tioning imperfectly. Either of these related contingencies will defeat 
the aim of random sampling and result in biased sampling. Thus, in 
selecting a sample of 10 slips from a receptacle containing 200, random- 
ness would be precluded if the slips were unequal in size, shape, or weight; 
or if they were carelessly mixed so that the last names dropped into the 
hat would have the best opportunity of being drawn. But the outward 
appearance of the extracted sample would never reveal such procedural 
flaws, or biasing factors. The composition of a non-random sample is 
visibly DO different from that of a sample of the same size selected by 
random procedures; the sampling operation leaves no telltale mark. 
Thus, if we were to come upon two different scatters of ten coins each 
streim oa two tables, the one all heads and the other showing six heads 
and four tails, no amount of visual inspection would tell us whether either 
or both of them bad been carefully laid down or whether they had been 
tossed at random. Nevertheless, the observer will intuitively conclude 
that the first had probably beeA laid down and the second had been arbi- 
trarily tossed, wibyl Because the probability of 10 heads together is 
60 small, and the division of 6 and 4 much greater. In fact, the prob- 
abilities compare as 1 to 210. Still, he can never be certain. Similarly, 
in the case of our hostess, the name on the winning slip can never betray 
whether the drawing was honest or biased. 

However, a suspicion of biased sampling will be aroused by the sub- 
sequent discovery of a marked discrepancy between the sample value 
and the true value ol the universe, when and if the latter becomes known. 
The now classic example of such a melancholy outcome is the notorious 
Lilcrary Digest Presidential pre-election poll of 1936. IITiile Roosevelt 
obtained approximately 60 per cent of the popular vote in the actual 
election, his percentage in the Literary Digest sample of over 2,000,000 
respondents was only 40 per cent — a difference of 20 percentage points! 
This discrepancy in such a large sample was symptomatic of a gross 

347 



Saufusq 


ject to human bias and enor. And even when the universe is in plain 
sight, it is probable that the observer will misjudge to some extent the 
representativeness of a sample. To demonstrate such bias, the English 
statistician Yates once requested J2 pemons to select three samples of 
20 items each from a collection of 1,200 stones, each sample to represent 
as accurately as possible the wse distribution of that experimental uni- 
verse. Although the observers were free to view the collection at their 
leisure, still there was a consistent teidency to exaggerate the average 
size of the stones and to minimise their variation. In short, there was 
a constant error in judgment. We cannot he sure that analogous errors 
did not bias the “Fa” sample. Because of the practical certainty of 
human bias, judgment sampling must therefore be applied with great 
caution. But whatever the limitationa of non-random samples may be, 
the techniques of descriptive statistics must obviously be competently 
applied in order to assure their maximum utility. 

Random Sampling Proetdures 

Df/inition 0 } liandom Samjiling. If available samples are fallible, and 
expert judgment U not to be trusted, then what are the factors that 
should determine the composition of the sample? The answer is: chance 
factors. However, in view of the low regard in which chance factors 
are usually held as guides (0 action, it is surprising (hat we should so 
willingly lay aside our cumulated knowledge and experience and go to 
the other extreme, permitting blind chance to determine the choice of 
the sample. In most human situations, we wish to eliminate chance, 
since it disturbs our predictions. Nevertheless, the ideal sampling pro- 
cedure is one in which tho drawings are a^eclcd by impartial chance 
factors alone, with the result that one item in the universe is as likely 
to be included in the sample as another. No item is accorded a pref- 
erential advantage. Jn fact, random tamjding is defined as a procedure 
that provides an equal opportunity of eelecUon to each unit in the popu- 
lation. We eliminate personal bias by elimination of the person, and 
thereby permit only the play of impersonal chance forces. 

There is notWng esoteric about random sampling; most persons are 
familiar with a few homemade routines that assure an impartial random- 
ized selection. The thoughtful hostess wishes every guest to have an 
equal opportunity to win the door prise, regardless of wealth or friend- 
ship; the contest is to be perfectly democratic. Accordingly, everyone 
is requested to write his or her name on a blank slip and drop it into the 
hat,. the Vast arrival, thft dips tcit Vhuroagbly imed, and one 

name is drawn and declared the winner. Although only one person can 
win the door prize, we still say that the probability of winning was iden- 
tical for all. Among 25 guests, the probability of winmng would be 1 
346 



Saufuko 


For finite universes and samples of any size, this probability may be 
expressed in terms of the now familiar combinatorial formula: 

Pr (Given Sample) ^ ^ 

Applying these formulas to the illustrative data pven above, we have 
a (Total Possible Samples) - - 10 
Pr (Given Sample) * ^ 
which agrees with the previous results. 

iSampItn^ by Random Digiia. A rodimentary technique for carrying out 
simple random sampling has already been set forth: (1) represent units 
on slips; (2) thoroughly scramble; and (3) draw the required number 
of slips. However, the drawing need not be laboriously carried out in 
this manual fashion. It will usually be inefficient to do so, especially 
when the population is large. It is much more practicable to substitute 
a table of randomly ordered digits for the shuffled numbered slips — a 
common procedure of research workers. This technique necessarily re- 
quires that we number serially all of the units in the population from 1 
through N and then draw from the correspooding table of random digits 
as many different numbers (combinations of digits) as there are cases to 
be included in the sample. The cases whose serial numbers correspond 
to those drawn from the table constitute the sample. Table IV of the 
Appendix presents such a list of random numbers. 


Other Random Sampling Proadures 
Simple random sampling is the most primitive and unrestricted selec- 
tion procedure and most clearly exposes the essential operation of ran- 
domness. Being free of procedural modifications which are often made 
necessary by practical circumstances, it b conceptually the simplest of 
all sampling routines and b therefore so bbeled. Smee it b random sam- 
pling in its most uncomplicated form, it serves os a standard of sampling 
efficiency against which other types are compared and evaluated. These 
other types are made necessary by the fact that simple random sampling 
in its pure form can almost never be employed in large-scale socbl re- 
search. It b far too impractical and costly, and often even impossible. 
Kevertbeless, it b essential that the student clearly comprehend its basic 
characteristics in order to be able to recognue and appreciate the degree 
to which the alternative methods depart from thb standard modeL 
Three of the most prc\'aIeDt altcmatlii'e types are here set forth as a brief 

349 



SiuniNQ 


defect in the sampling procedure, which, upon later review and analysis, 
statisticians easily established. Although such a discrepancy could have 
occurred by chance, it would have been extreniely unlikely. Hence the 
deduction that the method of sampling was biased. But such appraisals 
are always retrospective. In advance of the sampling, one can only pro- 
vide for adequate machinery which will be reasonably certain to yield a 
random selection. 

Simple Random Sampling. If called upon to devise a do-it-yourself 
sampling technique, the inexperienced layman, like our hostess, would 
probably procure as many slips as there are items in the population to 
be sampled. Slips of paper are more easily shuffled than people. We 
may surmise that he would then number these slips consecutively from 
1 through W, corresponding to the numbered units of the universe. Next, 
he would place the slips in a suitable receptacle and mix them until the 
set was thoroughly scrambled. Finally, he would reach in and take out 
as many cases as desired. If he wished a sample of 10 cases, he would 
take out 10 diffeieDi slips. StatisUeians recognize this as the simplest 
type of random sampling and have therefore dubbed it simple random 
sampling. 

They do not casually describe it as “reaching in and drawing out n 
different items." Bather they define it as that procedure in which eveiy 
distinct sample of n items hu an equal probability of selection from a 
finite population of N items. This de^ition expresses the long run 
consequence of "reaching in and taking out n different items.” For, 
if that procedure were applied indefinitely to a given population (re- 
storing the entire sample after each trial), each different sample would 
tend to reappear an equal number of tiroes. In its procedural aspect, 
simple random sampling is the apparatus that guarantees the fulfillment 
of this criterion of equal probability. In its substantive aspect, it is 
the very criteriou itself. 

To unfold further the meaning of simple random sampling, let us con- 
sider the number of ways in which samples of 2 items can be selected 
from a miniature population of 5, whose members we shall designate: 
a, b, c, d, e. By roanipulation, we discover that there are 10 different 
possible combinations, or samples: 

ab be cd de 

ac bd ce 

ad be 

. ae 

Hence, the probability of each combinatioa in simple random sampling 
must be 1 in 10; each sample is expected to occur on the average once 
in every 10 trials. 

348 



Saufuno 


from each of the broad geographical regions of the United States, it being 
plausibly assumed that public opinion on many issues varies from one 
sector to another; the Midwest, for example, is internationally more 
isolationbt than the East. Similarly, subsamples are commonly drawn 
from various age and economic levels, as these factors are also kno\vn 
to exert an inSuence on the content and intemity of public opinion. Older 
persons are generally more conser\'ative than j'ounger; political opinions 
tend to parallel economic interests. On the other hand, national opinion 
polls would never stratify the population by hair color, since there is 
probably no genuine correlation between hair color and political opinion. 
In general, stratification serves no purpose when the stratitjung factor 
is uncorrelated with the sampling trait being measured. 

Companson of Strafa. While the principal justification of stratified 
sampling is equal accuracy with a smaller sample, the comparative data 
which are ita natural by-product pro\dde an additional inducement for 
using it. Thus, the decision to stratify by occupation in a survey of 
opinion on labor unions may be prompted as much by the nisb to com- 
pare the characteristics of the various occupational classes as by the need 
to economize on sample size. The differences among occupational cate- 
gories (Table II. 1) on the question **Are you in favor of labor unions?” 


AUitude Toward Labor Uniont fcy Oeat- 
Tabu 11,1 paiion, Prrcenlage Dittribulion, U.S. 


OCCCF.ITIOK 

Favosabix 

UsTAYORABLE 

Total 

Fanners . . 

52% 

4S% 

100% 

Businessmen . . . 

66 

24 

100 

White Collar 

69 

31 

100 

Proressional 

77 

23 

100 

Skilled ... 

75 

25 

100 

Unskilled 

71 

29 

100 

Totai. 

CT% 

33% 

100% 


Sourm; Amerieu Iiutitute <4 Public Optaioo, Mtr. 1943. 


may, in fact, be even more pertinent and revealing than the over-all 
weighted average of 67 per cent, which necessarily conceals such differ- 
ences. Since strata are often treated individually, it has been suggested 
that the term “domains of study” be applied to strata when they are 
bemg analyzed in this segregated tnaacer. 


351 




Saupuko 


introduction to the subject: (1) ^ratijied, (2) duster , and (3) interval sam- 
plmg. These sampling devices themselves are largely an outgrowth of the 
sheer practical problems that have arisen in the sample surveys of large 
human aggregates, which accounts for thar wide currency in social science. 

Strclified SampSing. "Do the dUseis of Bro\vnville favor racial inte- 
gration of schools?” No competent surveyor of public opinion would 
attempt to answer that question without canvassing both white and Negro 
residents of the community. Public sentiment on this issue would not 
be reliably portrayed by a sample that slighted either group, rince their 
opinior« are so divergent. Yet such an imbalance between groups might 
occur under simple random sampltog, unless the sample were made 
large enough to forestall that eventuality. A more economical alternative 
would be to sample each subgroup separately and combine the results, 
and thereby avoid a costly inflation of sample sise. Such an operation, 
which first separates the entire popuIaUon into relevant strata before 
randomly drawing the sample, b known as stratified sampling. 

From a procedural standpoint, stratified sampling therefore consists 
of the following stages: (I) division of the total universe into subclasses, 
or strata; (2) the selection of a random sample from each stratum; and 
(3) the consolidation of the subsample statistics into a combined statbtic 
weighted for size of strata. In thb context, the term stralifioaiion does 
not, of course, connote a hierarchy, as the ranks of an army, or the gei^ 
logical seams of the earth crust; rather it signifies the categories of a 
Btatbtica] variable, such as race, sex, or religion, into which the total 
population is conveniently divided. 

Nor does stratification imply a rebxation of the requirement of random 
ized selection, although that inference has been sometunes mbtakenly 
drami. This misconception may possibly reflect a failure to dbtinguish 
clearly between stratified random sampling and so-called quota sampling, 
which was formerly widely used in opinion polling. In quota sampling, 
quotas are pre-assigned to strata, but the final selection of cMes is left 
to the discretion of the interviewer. However, if the benefits of random 
samplmg are to be attained, subsamples from strata must be randomly 
chosen. The resort -to convenience or judgment sampling b no more 
warranted within a given stratum than it b in the whole, unstratified 
papulation, and b almost certain to lead to biased results. 

Since stratified sampling b more complex than simple random sam- 
pling, we may rightly ask what are its cmnpensoting advantages. Briefly 
put, it b a labor-saving device for securing equivalent accuracy with 
fewer cases than b likely under ample random sampling. It b essen- 
tblly for thb reason that national public opinion poUs universally resort 
to some form of stratified samplii^ in order to keep the size of the sample 
down to manageable proportions. Thus, subsamples are usually selected 
350 



Saiitlino 


However, the executioD of any sodal survey is not fulfilled by the mere 
drawing of the Eampling unite. We still must make the personal contact 
with the selected elementary units to obtain the information which was 
the objective of the study in the first place. This problem of personal 
contact develops into a major obstacle in simple random sampling, which 
may yield a sample whose elements are widely dispersed over a wide area, 
requiring a prohibitive expenditure of time and energy to reach. It is 
in such a plan for personal interviews that cluster sampling displays an- 
other technical advantage. (Tnder thCa procedure, the elementary units 
are territorially concentrated, and are ^erefore more easily accessible^ 
with less wastage In transportation. Tbb b shown in Figure 11.1. 



Simple Random Sample Cluster Sample 


FiouaEll.l Sth,emalie Diagram, Simple Random and Clyuter 
Sanding 

But the physical and mechanical convenience of cluster sampling b 
purchased at a price in quality. It generally lacks, to a degree, the very 
characterbtic which b the objective of good sampling: typicality and 
representativeness. Its reduced representativeness b due to the fact 
that the elementary sampling units within clusters, particularly human 
clusters, are likely to be closely similar in regard to their social character- 
btics; consequently, sampling within these clusters understates the 
dbpersion and provides unnecessary duplication. For example, the resi- 
dents of a given city block are likely to belong to the same socio-economic 
class, and therefore hold the same political and social opinions. By the 
same token, they are not likely to be representative of any diversity of 
public opinion in the large aggregate of unsampled neighborhoods; a 
cluster sample b thus less likely to represent the variety of opinions than 
a simple random sample ol the same size scattered over the community. 
It b thb lack of heterogeneity within clusters which reduces the com- 
parative effectiveness of cluster samples. 

Thb Ibbility notwithstanding, cluster sampling b being increasingly 
employed in social research. For, after all, statbtical work — like every 
other human endeavor — b a cennpromise between the ideal and the 
practical, and in any case can attain only an approximation of “truth.” 

353 





SAVpmo 


Chister Samplinff. In the ease of the drawing for the door prise, it tyas 
obviously necessary to have a complete list of eligible names, from which 
the winning one was randomly drawn. For, “equal opportunity to be 
included” implies not only a mechanically reliable seiection procedure, 
but also a complete list, so that every name is accessible to the draw. 
It would similarly be necessary to have a complete source list of sampling 
units if a simple random sample were drawn from any other statistical 
universe, such as the inhabitants of a community, an army, or a univer- 
sity student body. But such social aaiverscs differ from the more or 
less trivially small, easily manipulated guest list of a banquet. When 
the universe is vast, and extends over a wide area, the compilation of 
such an indispensable list is a laborious undertaking; and even when 
available (e.g., a city directory), it is u«ually not up-to^.ate on the date 
of issue because of the mutability of the population. 

If, therefore, an adequate list of elementary sampling units is not 
available, «e may turn instead to more or less permanent groupings 
into which the population is natumlly divided and which can be conven- 
iently listed. Human beings arc, of course, usually found in prevailingly 
standard groups; they are clustered geographically by states, counties, 
municipalities, neigh^rhoods, block.*, precincts, and dwelling units; 
people work together in factories, offices, and stores; they are erganiied 
in innumerable clubs, lodges, schools, and mUccIbncous associations. 
It u these groups, or ehtUrs, which may be serially utilited as campling 
units, through which we reach the ultimate elementary unit (e g., the 
person or household) which is the objective of our suni'ey. Such a pn^ 
cedure is therefore termed clwjfrf Mmpfin^. We thus finally reach the 
elementary unit through a shorter or lon^r chain of samplings of the more 
easily listed clusters. Since sampling is carried out in successive stages 
before reaching our destination, thb type of aampling is also called muWt- 
tlagt tampling. ^Tien the dusters at any stage consist of territorial 
units, we may describe that stage as orea tampling. 

Let u9 suppose that we wish to sur^-ey the occupational ambitions of 
high school pupils, ages 15-17, in Chicago. It would be exceedingly 
iaborious and financially prohibitive, as well as hatardous in accuracy, 
to compile a list of all specified pupils in that metropolis. However, a 
permanent list of high schools is easy to obtain. These schools could be 
randomly sampled, and a list of students in the desired age categories 
would then be obtained from the comparatively few high schools in the 
sample. From that list, the sample of students would be finally drawn. 
Similarly, if the households of the dty were to be surveyed, a sample of 
cjVy fc^ocis, then of dwelffog ooif^ and ffnaffy of fiousefiofcfs would be 
drawn. Even though such a multi-stage procesdure would still require a 
source list at each level, the lists would be Bmaller and more current. In 
such economy lies the first advantage <d the cluster approach. 

352 



SAurwG 

presumed to be randomly ordered, simple random and interval sampling 
will j-ield identically accurate results. Such a presumption is often 
reasonable when items have been alphabetically listed, since there is 
usually no correlation between the alphabetical order of names and the 
traits which the named objects possess. Thus, we have previously seen 
that the alphabetical listing of large American cities orders the respective 
suicide rates in a sequence which is seemingly purely random. Simi- 
larly, an alphabetical listing of students may be expected to result in a 
sequence of grade averages that is wholly random and, therefore, free of 
trends and cycles. In such cases, it makes no ultimate difference whether 
we select random digits or draw every kth unit, except that interval 
sampling is usually more simple to execute. The long-run results would 
be virtually identical for any pven size sample. 

There is, however, one notable circumstance that constitutes a special 
hazard for interval sampling and may easily lead to erroneous conclu- 
sions. 'U'hea the universe ralues form a cyclfcol progression, the sampling 
inter^’al may coincide with the phase of the cycle, causing interval sam- 
pling to >ield an unrepresentative set of identical values. Let us consider 
a fictitious sequence whose phase is four: 1, 2, 3, 2; 1, 2, 3, 2; 1, 2, S, 2. 
Now, if the sampling inter\’al is set equal to 4, any sample will necessarily 
consist of & set of Identical values. It will consist of aU I’s, all 2’s, or all 
3*8. Such samples nnll in no way do justice to the variation in the universe. 

This type of pitfall is illustrated in a sampling study of June issues of 
the Sunday New Kork Times, 1932-1942, which disclosed that only 
Protestant marriages were featured on the society page of the sampled 
issues.* From this finding, the conclusion was draam that the upper- 
upper social class of New York City was preponderantly Protestant in 
religious background. But this inference was immediately chaUenged on 
the groimd that Jewish marriages happen for ceremonial reasons not to 
be performed in June, and therefore had no opportunity to appear in 
the issues of the Times which were sampled. A check sampling, un- 
disturbed by daily and monthly cycles, revealed that Jewish marriages 
were in fact regularly featured by the society editors during the appropri- 
ate seasons. By that criterion, Jews were proportionately represented 
in the upper social strata. In this instance, the sample inter^’al led to an 
overrepresentation of Protestants, an error compounded by the un- 
fortunate judgmental selection of June as a point of origin. 

But inter%’al sampling also carries its intrinsic advantages. In fact, 
when the numerical values form an arithmetic progression, interval sam- 
pling will be even more effective than simple random sampling. For, in 

•David and Mary Batch, “Critaris of Social Status as Derived from Marriage An- 
nouncements in the Xeia York Timet," AmericanSoeiole^eol Hertfir, XII, I9t7, pp.SOfr- 
403; and tV. J. Cabnman, "A Note oo Jilarriage Announcements in the S'ew York 
Tima,” Ameriam Soeiotoffieal Rfrita, XIII, 1948, pp. 96-97. 


355 



Sampuso 


Interval (Si/strmalte) Sampling. Whenever the sampling units are ar- 
ranged in some kind of natural sequence, like consecutive admissions to 
a hospital or library books in the card catalogue, it may be economical 
and even preferable to obtain a sample by taking cases at a fixed interval. 
Such a procedure is termed intcrcal tampling, or more commonly but 
less aptly, sysfemaifc sampbnj. The selection of every tenth name from 
the telephone directory, after a random start among the first ten names, 
illustrates the process of interval sampling. Such a procedure has obvious 
utility for the social scientist who frequently has occasion to study a 
series of events such as a file of newspapers, the characteristics of dwell- 
ing units in a pven ecological area, the cases on docket in a criminal 
court, or a card catalogue of welfare case records. 

To establish the width of the sampling interval (fc) in any given prob- 
lem, we merely find the ratio of population size (A^) to desired sample 
rise (n): 



Thus, if the sample is to contain 5 per cent of the universe, or 1 out of 
every 20 cases, the sampling inter^ obviously would be 20; and we 
would draw every 20th item, randomly startbg with any number within 
the first interval of 20. Such calculation prcsuppc«cs, of course, that 
sample size has been fixed in advance of the sampling. But if n has sot 
been set, and an arbitrary interval is employed, it b still necessary to 
pass through the entire sequence, even though we may seem to have an 
ample number of items after we have proceeded only part of the way. 
If we discontinued the drawings before completing the entire circuit, we 
would deprive the units in the omitted segment of the opportunity of 
bebg chosen and thereby destroy the randomness of the operation. 
For example, by skipping names L to Z in an alphabetical Ibting, we 
would almost certainly produce a biased sample. 

Since each interval contributes one and only one item to each sample, 
it follows that there can be no more di^erent samples than there are 
items within the interval. Thus, if the sampling interval b equal to lO, 
there can be only 10 possible samples, regardless of the size of the uni- 
verse, be it 1,000 or 1,000,000. But the number of different samples 
would be almost incalculable in simple unrestricted random sampling. 
Thb restriction in the number of posable samples serves to dbtinguish 
interval sampling from simple random sampling, since the latter furnishes 
an equal opportunity of selection to every dbtinct combinatorial sample 
of n items. 

In spite of thb severe limitation, interval sampling will often produce 
results that compare favorably in representativeness to those yielded by 
simple random sampling. In particular, whenever the values may be 
354 



Saupunq 


Analogously, the effectiveness of cluster sampling will bo enhanced by 
expertly composing and recompoang clusters jn advance of the sampling 
so that each cluster is as representative of the entire population as possible. 
Insofar as that effort can achieve success, given the heterogeneity of the 
grand universe itself, the respective clusters will tend to resemble one 
another, while there will be eonsidciable statistical variation within each 
cluster, niu", in cluster sampling we invert the specifications of stratified 
sampling: instead of homogcniiing strata we diversify the elements within 
clusters. We may, for example, combine precincts into larger geographic 
districts in order to incrcft«c the diversification within clusters and thereby 
raise their repre-'cntativencss of the entire electorate and fulfill their func- 
tion as samples. 

In sum, no sampling technique b completely automatic; all involve 
subject-matter deci'ions. Hence, first-hand practical experience with the 
concrete subject matter contributes quite as much to fulfillment of a 
sampling project as dexterity in the mechanical routines of applied sta- 
tistics. 


Questions akd PROBLE^ts 

1. Define the foUowins concepts: 

Universe 

Sample 

Infinite Topulstion 
Unite PopuUtion 
Sampled ropulation 
Target Population 
Availability Sample 
Ilaphaxard Sample 
Judgment Sample 
Bandom Sample 
Simple Random Sample 
Stratified Sample 
Quster Sample 
Interval Sample 

2. State difficulties that might be encountered In defining the populations: 

Families 
Dwelling Units 
Farms 
Households 
City Blocks 
Broken Homes 
Overcrowded Homes 
University Students 
Gainfully Employed 
Social Class 


357 



Saupumo 


that event, the sample will necessarily' distribute itself evenly over the 
entire range of values and thereby provide a reliable miniature of the 
population distribution. If, for example, we select every fifth boy from 
3 lineup according to height, the rcaalting sample will necessarily be repre- 
sentative of the distribution of b<qra’ h«^ts. Analogously, if we sample 
every 25th dwelling unit along a metropolitan avenue after a random start, 
we would probably obtain an accurate cross-section of the various socii> 
economic groupings on that street since dwelling units are segregated and 
ordered according to social status. In tlua way interval sampling may 
supply its own stratificaticm. 

From this, it is evident that the principal advantage of interval sampling 
lies in the mechanical ease with which it can be applied to such natural 
sequences as rows ot dwelling units, card files, city directories, and so on. 
Its special hazard is the cyclical sequence, and we must maneuver to ci> 
cumvent it when that danger is thought to exist. 

Jnlerre/oledness 0/ Random Scmjding Procedure*. Quite obviously, the 
foregoing random procedures are not mutually exclusive. They may be 
— and usually are ■— combined in a variety of ways. Cluster sampling 
may be used nithin broad strata, and bterval sampling may be used 
within these clusters No single sample design b best for any given pur- 
pose. In all sampling, an attempt b made to attain the desired degree 
of representativeness as economicaHy as possible, which is the guiding 
criterion of modern samplii^ design. We should recognise that the 
designing of a sample b a form of statbtical engineering and accounting, 
requiring appropriate skills and knowledge. In the foregoing statement, 
we have merely hinted at the technical aspects of sampling, which are of 
course fully developed In treatises on that subject. 

However, effective sampling requires much more than mere technique. 
If the assets of a given sampling procedure are to be fully realized, it b 
essential that all necessary discretionary as well as mechanical steps be 
expertly performed. Thus, the anticipated benefits of stratified sampling 
will not be attained unless the strata into which the population was judg- 
meiilally divided before the sample was drawn actually differed among 
themselves on the sampled trmt. In the aforementioned instance, stratifi- 
cation by race would be profitless if Negroes and whites shared the same 
opinions on school integration. If on the other hand, Negroes and whites 
differed widely, then, statistically speaking, there would have been con- 
siderable variation between strata, but relatively little within each stratum, 
thereby validating the ori^nal deorion to stratify by race. It b more 
efficient in terms of sample size to sample from two homogeneous strata 
than from one very mixed, heteri^eneous stratum. The knowledgeable 
worker must anticipate the validity of the stratifying criterion before the 
sampling begins. 

356 



Saufuno 


mixture of nitionAlitiea: & Polish conceotration (5 per ceot of the population) 
in letters X, Y, Z, and an Italian (5 per cent) concentration in P, R, S. 'U'hat 
sampling procedure would >‘Ou use? 

IS. (a) “The admissions to the hospital over a period of a year constitute a 
random order.” Comment. 

(b) Comment similarly on the daily list of dispen&ary patients. 

(c) Comment on a list of prison admissions. 

10. Suggest a procedure to determine whether there is a correlation between 
national origins and initial letter of the name. 

20. Distinguish clearly between the concepts “cluster” and “stratum” in method 
of selection, ^^*hat arc the desirable characteristics of a cluster? a stratum? 

21. From a class of 25 pupils, how many diatinct triads (groups of three) can be 
formed? 

22. (a) 11 a person were dealt a bridge hand (sample) of 13 black cards, would 

that prove the deck (popubtion) consisted of all bbek cards? 

(b) Does it prove the drawing (dcaO vas biased rather than random? 

'23. Consider a universe of 12 elements: A to L 

(a) IIow many aimplo random samples of sue 4 can be fonerd? 

(b) How many interval samples of site 4? 

(e) How many stratified samples of site 4. when Stratum 1 consists of elements 
A-F, and Stratum 2 consists of elements G-L, and 2 elements are to be 
randomly selected in simple manner from each stratum? 

24. (a) Consult the tablo of random digits in Ute Appendix (Table n*). Write a 
directive for sclectmg by random digits a simple randoin sample of 100 (n) 
items from s universe of 13,300 (AT). 

(b) In this instance, what is the sampling ratio or fraction? 

(c) How many more cases would have to be added if (he sampling ratio were 
fixed at 1^7 


Selbcted Repekences 

AekoS, Russell L., Tht Design^/ Social Keifareh. The University of Chicago 
Fiess, Chicago, 1953. Chapter 4. 

Cochran, tViUiam 0., Sanpfihg TeeAnijutt. John Wiley A Sona, Inc., 
New York, 1953. 

Cochran, William G., Frederick AloeteRer, and dohn W. Tbkey, SlalCs- 
tical ProhUmi of the Kiiuef Heporl on Sexual Beluivutr in Ihe Human 
.UsU. The Aaericaa StaHstical Asaec'isiios, Washiogtoa, JXC., 
1954. 

Havemann, Ernest, and Fatrioa Salter West, They Went to College: The 
College Graduate in Atneriea Today. Ilareourt, Brace and Company, 
New York, 1952. 'Appendix. 

Kbb, Leslie, “SeJeeUon of fie Samid^” in Httrarch MtlAodt in the Be- 
beaioral Seieneet, edited by Lecm Feetinger and Paniel Kata. ^The 
Dryden Fress, New York, 1953. Chapter S. 


359 



SaMPLIVQ 


3 Which universe could be more precisely defined: n carload of wheat ora group 
of ail taxpayers’ a shipment of electric light bulbs or all gainful workers? 

4. Compare the problems of sampling a ihipmcnt of wheal with those of sampling 
a human population. 

5. If >-ou were to select one sample peach (o take home, would you select ran- 
domly or judgmeotally? 

6. (a) Since random sampling gives the ‘'deviant” ease the same opportunity to 

be selected as the ” typkal,” what b its justification? 

(b) How do unrepresentative casts contribute to the representativeness of the 
sample? 

(c) Would a sample, carefully and randomly drawn, necessarily be repre- 
sentative? 

7. It you had to select and interview a random sample of residents (adult) in the 
community, what procedural dUhcultics would you almost certainly encounter? 

8. In a community survey, what would be the disadvantages of selecting a sample 
of families from only one neighborhood? Under what circumstances might 
such sampling procedure be acceptable? 

9. Eaplaio “Stratified aampliog b possible only ahes there is some previous 
information on the population.” 

10. How would you proceed to sample the following? 

fa) The students on a campus for oceupatkm of fathers 
(b) Households in a city for number of gainful workers 
(e) A concert audience for sockMeonomie membership 

(d) A theatre audience for reactioa to play 

(e) library reference room readers for length of stay 

11. Distinguish stratified random sampling and quota sampling. 

12. A sample of all persons 65 and over b to be drawn in » given community. 
Suggest possible “standard" dusters that might be used to expedite such a 
plan. 

13. "An area cluster sample could be rc-Used for successive studies of a mettfr 
politan commumty over a period of one jxar." Comment.- 

14. State how you would check the aasuraption of the rinj« survey of college 
graduates that "Fa" names are randomly dbtributed in socio-economic and 
other similar groupings. 

15. Along a given street a mile long, the socio-economic status of the residents 
rises for j of a mile, and then drops aharpty at the edge of the city. IVhat 
sampling procedure would you use? 

16. list possible non-random steps in restricted random sampling. 

17. Thereare 1,000 householdsonrdieflnagiveoagcncy; they are alphabetically 
arranged in a card catalog. A sorial welfare administrator wishes a sample of 
100 in order to determine the average amount of relief. The population b a 

358 



Statistical Inference 


Sccnos* O-VE 

Parameter Estimation 


PrcUfmt in Ettmalion. It should be unneeemry to repeat that a sample 
ia useful on])' for the information it suppUea on the characteristics of the 
univmc. Tliu*, the samplins process finds its ultimate consummation in 
a description of the population from which the sample is drawn. But this 
description can beoalp an approximation: anovcrapc monthly expenditure 
of 192.75 in a random sample of college students will not correspond exactly 
to the a%'erage expenditure of the whole student body which the sample 
is designed to represent. Nor, obviously, will a second sample of $5S.fi2 
necessarily corrcsjwnd more closely to the unknown parameter. If, there- 
fore, owing to the vagaries of chance sampling, no two samples are alike 
and all are in error, with how much confidence can we speak of the ^mlue 
of the universef Clearly, we cannot merely project the toIuc of the sample 
onto the universe, and let it go at that. This procedure of adopting the 
sample x'alue in lieu of the parameter, or universe ralue, is hedged about 
with as many regulations os is the procedure of sampling itself. Wc call 
this set of prescribed procedures statuticol tn/crcncc. 

Since, in the practical affairs of life, it is so rare that we are able to exam- 
ine a whole uni%-erse, and since sampling is so frequently the resort, the 
procedures of statisticaf inference loom quite large m the repertory of 
statisticians. In fact, some would actually identify the science of statistics 
with the problem of sampling and deciston-making — an extreme emphasis 
which seems to the authors unrealistic, since descriptive statistics have a 
validit)' and importance in their own right. 

Tlie problems of statistical inference be^n with the legitimate a-ssump- 
tion of a discrepancy between the variable sample estimate and the uni- 



SAvnisa 


WcCartJi}-, rhtJip J„ /ittroiiuetim to StatitlUal FeMenitj. ^feCraw-^UI 
Book Company, loe., New York, 1937. Chapter* 9 and 10. 

Parten, Mildred B., Surrey*, Pt/t* ttJtd Samfitt. Harper 4 Brother*, New 
York, 1930. Qiapten 4 aod 9. 

Stephan, Frederick F., "History ol the U»e* of Modern Sarnpling Pro- 
cedures," /ournitf o/ (le^vimctiii StatuAVa/ if Uaciatiim, XLfll, tOfS. 
Page* 12-39 

Slephan, Frederick F., and Phdip J. McCarthy, Sampfmp Opini'ons* dn 
dnafysis o/ Surrey Prorrdure. John Wiley 4 Bona, loc., New York, 
195.1. Chapters 1-3 

Vatee, Frank, Sampliny JfeUad* /or Cmruaea end Sunryt, 2d edition. 
Charlcf CriSio, London, 1933 


360 



i 12.1 PARiMr T-ra £5n]UTI0K 


Toblf 12.1.1a Amy of 100 SampU Means, n ^ SO 


9^ 

11.6 

122 

12.7 

13.1 

10.0 

11.6 

122 

12.7 

132 

lOJ 

11.7 

122 

12.8 

132 

10.3 

11.7 

122 

19 K 

132 

la? 

11.S 

122 

19R 

132 

10.8 

llA 

12.4 

12.S 

132 

10.S 

l\£ 

12.4 

122 

132 

10.9 

Its 

12.4 

122 

132 

10.9 

ns 

12.4 

122 

13.4 

IIJ) 

12S 

122 

129 

132 

11J3 

12D 

122 

122 

132 

11.1 

12D 

122 

13X1 

132 

iia 

12.0 

12£ 

132 

13.6 

11.1 

12.0 

122 

132 

13.7 

11.3 

12A 

12X 

132 

132 

llA 

12.1 

12j6 

13.1 

132 

nji 

12.1 

12.6 

121 

142 

iii 

12.1 

12.6 

121 

11.0 

11.5 

122 

12.6 

13.1 

142 

11.6 

122 

12.6 

121 

142 


Frt^uenq/ TaBy, Empirical Sampiiny Dielnlntliont 
TaiU 12.1.1fa 100 ScmpU Meant. n~SO 


Cuss 

ISTEETAL 

TiliT 


9.0-92 

/ 


102-102 

-f/ff ill 

S 

112-112 

mmm-m 

20 

122-122 

m-mmmmmmmn 

42 

132-132 

mm-m-mtm 

25 

142-142 

lilt 

m 


Inspecting tliis lot, Tve see that only-4of the 100 samples coindde exactly 
with the universe mean, known to be 12.4. All others contain a sampling 
error; in lact, every one of them would probably have shown a sampling 
error if we had insisted on greater decimal accuracy. VTecon see, however, 

363 










5 12.1 Paraueteh Estuiatios 

ber of samples is called an empirical sampling distribvtion. The theoretical 
sampling distribution would, of course, be attained only by an Infinite 
number of samples, since it would require that many to assure the dis- 
tribution of sample means in their proper, expected proportions. The 
agnificant procedural point, however, in this type of problem, is that we 
can usually treat the theoretical sampling distribution of means as il it 
were a perfect, smooth, ideal normal curve. 

i>tstnbiiii<m of Sampling Errors. By subtracting the universe mean from 
each of the 100 sample means, we obtain the 100 sampling errors (Table 
12.1.2a), which of course have the same cur\*e pattern as the means them- 
selves (Table 12.1.2b}. All we have done is move the zero origia to the 
imiverse mean (12.4). 


Array of Sampling Errors, 100 
Table 12.1.2a Samples, n = S0 


- 2.5 

-4 

-4 

+4 

+.7 

2.4 


4 

4 

4 

2.1 

.7 

4 

.4 

£ 

2.1 

.7 

4 

.4 

£ 

i.r 

.6 

.1 

.4 

4 

1.6 

.6 

0 

.4 

.9 

1.6 

.6 

0 

4 

.9 

1.5 

.6 

0 

4 

.9 

1.5 


0 

.5 

1.0 

1.4 

.4 

+.1 

4 

1.1 

1.4 

.4 

.1 

4 

l.l 

1.3 

.4 

.1 

.6 

1.1 

1.3 

.4 

4 

.6 

14 

14 

.4 

4 

.6 

14 

1.1 

4 

4 

.6 

1.4 

14 

4 

4 

.7 

14 

.9 

4 

4 

.7 

1.6 

.9 

4 

4 

.7 

1.6 

.9 

4 

4 

.7 

14 

S 

4 

4 

.7 

1.9 


The distribution of sampling errors (Table 12.1.2b) again demonstrates 
that the small de^^atioIlS are very numerous; the larger discrepancies are 
few in number. If we had drawn a single sample only, we could be prac- 
tically certain that our sample mean would not have missed the true mean 
by more than two points, plus or minus; indeed, 6S per cent of the samples 
are in error by less than plus or mmna one. 


365 




Statistical iKnacucE 


that the universe mean is snugly nested in the very middle of the whole 
array of sample meana, and that most of the sample means are compactly 
huddled around the true mean of 12.4, which is practically at the 60 per 
cent division point, Approximately two-lhirds of the 100 means fall 
between U.4 and 13.3, or virtually within one point of the true mean. 
Characteristically, the sample means gravitate toward the parent mean, 
or the center of the universe distribution. 

It now becomes evident how improbable it is that a sample mean will 
deviate seriously from the universe mean. Furthermore, not only do 
sample means cluster around the true mean, but their pattern of distribu- 
tion takes on a shape untiustakably approaching the normal curve (Figure 
12.1.1). If we had drawn, process^, and tabulated all • possible samples. 



FiGURe 12.1.1 IlisfoffTwn cf tOO Sample ifeam, n ^ 30 


instead of merely 100 of them, we would have a poiJern of distribution 
which would display still greater conformity to the smooth normal curve. 
This hypothetical frequency curve ctf all possible sample means is labeled 
the theoreJicai sampling dtsCnimfum, and has, aa its mathematical model, 
the ideal normal curve. Any experimental distribution of a limited num- 


364 



Statistical Ikterence 


TahU 12.1.2b 

Frequency Dislnbulion of Sam* 
pling Errors 


The presence of sampling errors obviously complicates the task of draw- 
ing an inference about the parameter; but the neat pattern of theM errors 
renders the uncertainties of such inference formidable. Our dissection 
of the empirical sampling distribution has given us considerable reassui^ 
ance of the reasonableness of the behavior of samples, which rnll permit 
us to estimate the degree of conbdence of which the single sample is worthy. 

Sample Reliability. We must now take up tie technique which will make 
appropriate allowance for the ever present sampling erTCff, and thereby 
enable us to make reliable esUroales of the parameter. For, unless we can 
rely with a certain degree of confidence on our inferences, there is no point 
in making them at all. The entire purpose of sampling is to substitute a 
sample of given reliability for the prohibitively complete enumeration of 
the unknown universe. 

But we cannot measure the sampling error of our single sample directly 
ance ae are never given the true mean from which to compute it. That 
^ould suppose that we already have what we have set out to find in the 
first place. The single sample must ultimately therefore provide two 
kinds of information, as can be deduced from the foregoing analysis: 
(1) an'estimate of the parameter, and (2) the degree of confidence we may 
place in that estimate, or the rriiabt'fify of the estimate — in our example, 
the reliability of the mean. 

The procedures for measuring the reliability of an estimate are numerous, 
since they differ according to the sample statistic (for example, whether 
mean or percentage) and according to the make-up of the sample (whether 
large or smaU, simple or stratified). Here, we present only two of the 
simplest techniques for measuring sample reliability; these apply to means 
and percentages of large random samples, respectively. They will suffice 
tor our purposes, since other teriiniques produce results which in principle 
carry the same interpretation as the naelhod here unfolded. 

InUrral Estimate of the Mean. Let us suppose that, in taking a single 
sample (n •= 30) from the 107 suicide rates, we obtain a mean of 13.5. In 
such a realistic situation, we do not know the value of the true mean, sod 
366 


Class Interval 

f 

-3.4-(-2.5) 

1 

-2.4-{-1,5) 


.6) 


- .4- .5 


.6- 1.5 


1.6- 2.5 

100 




I 12.1 pARAireTER EsTIUATION 


Compuiation of the Standard Error. Both of the foregoing insights are in 
accord with statistical theory, and are actually explicit in the formula for 
the standard error of the mean, which contains n and ff as its basic terms. 
Without further proof, we pve the formula; 




<r 

•v^ 


( 12 . 1 . 1 ) 


where = standard error of the mean 

o = standard de\’iation of the universe 
n » sample size 

Of course, we do not have the standard deviation of the universe, as 
required by the formula; all we have is thestandard deviation, s* of the 
sample. We will therefore substitute it for the unknown parameter in the 
above formula. 


'V^ 


( 12 . 1 . 2 ) 


But the value o! the substituted standard deviation of the sample is 
not identical with that of the universe, owing to sampling error; in fact, 
the fonner tends to be smaller than the latter because the scatter of values 
within the sample tends to be smaller than that in the universe. To match 
this Understatement in the numerator (i.e., to maintain the ratio m the 
fraction), we correspondingly reduce sample size (n) by one in the de* 
nominator: 




( 12 . 1 . 3 ) 


This formula may fittingly be substituted for the population sigma. The 
correction becomes triml, of course, if sample size is very Urge. Never- 
theless, it is conventionally included, as a matter of principle, even in large 
samples. The working formula, with familiar symbols, will therefore read 
as follows: 



This expression, then, yields the standard error of the mean which we 
have been looking for and from which we shall construct the interval 
estimate. 


* Greek kttm are coDTentmeallr nwd to wp r t a t nt parameteri o, p); Latin 
•fttera (e^., $, X) reprtaent $aMpU tlatittieg, wfakh are ij^etasariJy MtinaUa. But 
Ikia prmrtke if not unilormly adkeml to bj wntera in the hetl. 


36 ? 



STATtSTICAt IkTEBEKCB 


is even three sigmas removed. But, it we had the value of the standard 
deviation of the sampling distribution, we could set up an interval estimate 
for the true mean; hence, we must obtain an estimate of this value. We 
label this value, not a standard deviation iSD), but more descriptively s 
$landaTd error (SE), since the deviations of all sample means from the true 
mean are actually sampling errors. 

Our sample alone must furnish the information on which to base an 
estimate of the standard error, rince it is all we have. Hence, we must 
explore the sample to uncover wune cues to the magnitude of the sought- 
for standard error. Initially, we might guess that the sire of the sample 
(n) and the degree of scatter In the sample (SD) are related to the sire of 
the standard error, according to the following reasoning. 

First, as to sample sire, we should expect larger samples to be more 
representative of the universe than smaller samples, and we would there- 
fore expect larger samples to yield smaller sampling errors. In fact, when 
a sample contains every last item in a finite universe (a 100 per cent sample), 
there can be no sampling error at all. Everj- “sample” mean would then 
necessarily be identical with the universe mean, and the standard error 
would be eero. At the other extreme, the amalJest poadble sample would 
contain only one case (a » 1). The distribution of sample means would 
then be identical with that of individual universe values, and would dis- 
play the some amount of variation as the itenis in the universe itself. 
Hence, in that instance, the standard error would be eriunl to the standard 
deviation of the universe (u). From this brief analysis, Nre may conclude 
that the standard error of the mean (a) will always be smaller than the 
standard deviation of the universe, since u will always be hrger than 1; 
and (b) will decrease as sample site (n) increases, reaching zero when 
n — 100 per cent of the Universe. 

Second, we intuitively expect the clegicc of scatter in the universe, as 
reflected by the degree of scatter in the sample, to affect the size of the 
samplmg errors. Thus, if all values in the universe were alike, all sample 
means would also be alike, and there would be no sampling error whatever. 
The standard error of such a completely homogeneous universe would 
always be zero! A sample of one boy would give us without sampling 
error the average number of arms for all boys, since every boy has the 
same number of arms — a perfectly homogeneous universe; however, a 
sample of one boy would not ^ve us without error the average height of 
all boys There is a great variety of heights, w hich produces comparable 
variety within sarnples, and consequently among the sample means as 
well. Inverting this argument, we conjecture that a high degree of scatter 
in the sample is indicative of heterogeneity in the unii-erse and therefore 
of large satnpling errors. We thus anticipate that the standard error for 
a given n will be larger when sampling a very heterogeneous mass of data 
than when sampling a reUUvdy homogeneous 
368 



5 12.1 PARAWtTER EsTOIATIOV 

and from anj' randomly selected mean we become virtually certain that 
the resultant inter%’al will trap, aaitwer^ the target mean. Even the most 
pessimistic soul, who always fears the worst, may give himself a sense 
of relative security by attaching three standard errors to bis observed 
sample mean. 

This method, be it noted, does not disclose the exact whereabouts of 
the true mean, as has been previously cautioned; it merely furnishes a 
stronger or weaker expectation that the true mean will be found within 
the specified interval estimate, derived from the observed sample. The 
true mean is stationary, wherever it is. Since it is the interval estimate 
that either succecda.or fails in enclosing the parameter on a given sampling 
trial, we may property speak of the probability of such on interval succeed- 
ing in encompassing the universe mean. Of course, once the sample has 
been drawn and the interval formed — once the trial is over — it either 
does or does not include the target mean between its calculated limits, 
although we will never know. Our confidence rating in that interval 
corresponds to its pre-trial probability, which has been selected for our 
convenience and purpose. 


TTorWnj Procedure. To illustrate the process of calculating a confidence 
interval, we carry out all necessary operations on the following sample 
of 30 suicide rates: 


6.0 

24.e 

13.4 


23.4 11.6 

8.4 10.0 

(1) We compute the sample o 


4.6 

14.7 

8.2 

26.1 

148 

10.4 


118 

II.6 

2S8 


(2) We compute the standard error of the sample mean: 




1 , 211.2 

(30)(29) 


18.8 

22.4 

10.0 

8.1 

6.0 

158 


(3) We multiply sx by the number of sigma units to be added and 
subtracted for the agreed-upon confidence interval — for example, 1.96 
for the 95 per cent interval; 

1.96si = 1.96(1.2) 

= 2.4 


371 



Statistical iNTEBtKCE 


Conttnidion of the Confidence Interval {Large Sample). Since the sampling 
distribution of the mean is approximBtely normal around the population 
mean, it follows that two out d three sample means (68.27 per cent) will 
lie within one standard error of the true mean. Our particular sample 
mean will therefore have a 2 in 3 chance of falling within one standard 
error of the true mean. Ifcnce, by adding and subtracting one standard 
error to and from our randomly selected sample mean, we have an approsi- 
mately 2 in 3 chance of enclosing the true mean. Similarly, 95 out of 100 
sample means lie within 1.96 standard errors of the true mean; in conse- 
quence, if we attach 1.96 standard errors to cither side of the sample mean, 
we have a 95 per cent probability of encloung the true mean. In general, 
it is possible to prowde any desired con6deDce interval by the simple 
technique of attaching the requinte multiple of the standard error to the 
observed sample mean. By thus making our interval estimate wide 
enough, we may be practically certain that the true mean has been con- 
tained within the interval, eves though the observed sample mean is 
highly inaccurate. 

Let us suppose that we have drawn a “ bad ” sample, whose mean, X», 
is located ia the tail of (he sampling distribution, far removed from the 
universe mean (jt) which it intends to represent (Figure 12.1.2). Clearly, 



Fiouae 12.1.2 rhrvrttiatl Simpltng Dittrihu- 
(ton 0 / Mem, Citerveti Sample Mean, X, 

in this instance, it would not have been eufBcient to attach only one 
standard error to either ride of the observed mean in order to catch the 
true mean; nor would it have been eamigb even to attach two standard 
errore, since the true mean is more than two standard errors distant from 
our jlJustrative sample mean. However, if we affix three standard errors 
to either side pf the observed mean, we obtain an interval whose lower 
subs^tially beyond the true mean. Since practically all 
(99,74 per cent) of the sample means lie within three standard errors of 
the universe mean, by adding and subtracting three stondard errors to 
370 




SrAT/STICAL IWTSHrNCE 


(4) We add and subtract 1.96si to and from the observed sample 
mean, X. Thus: 

J ± 1.96SX 
13.1 + 2.3 = 15.4 
13.1 - 2.3 = 10.8 

(5) We finaUy make the interpretation that the chances are 95 in 100 
that the true mean will be found witWn the interval 10.8-15.4. We are 
reasonably sure that our interval contains the true mean, since only 5 
intervals in 100 constructed in the same manner will fail to do so. If, 
however, it doesn’t, something has happened which would occur only 6 
times in 100, a risk which we may be willing to face. 

But how decide on the width ol the confidence interval? Here one tan 
offer only the most general guidance. Into the choice will enter a considers* 
tion of the consequences of the decision, the risk one will wish to run, and 
even the temperament of the person concerned. In effect, a decision like 
this is eomparahle to any other of the numerous decirions we make in the 
face of life’s uncertainties. 

Demonitration of Erptrimenlal Confidence Intervale. All statements of 
probability, it be recalled, are based on the assumption of an infinite 
number of trials. While it » not possible to conduct such an infinity of 
trials, It is usually possible to conduct a fairly large number of experimental 
trials, the outcomes of which then serve to test the initial probability 
statement. To that end, we have calculated the 95 per cent confidence 
Umite for each of our 100 samples (n = 30) of suicide rates, in order to 
determine whether the resulting intervals would actually enclose the 
true mean approximately 95 per cent of the times. The diagram on the 
opporite page (Figure 12.1.3) portrays the locations of the 100 confidence 
intervals, along with the known population mean of 12.4. It will be ob- 
served that exactly 95 out of 100 interval estimates do enclose the true 
'mean, a result which b, surpriringly enough, identical with theoretical 
expectation. Here is tangible evidence that the confidence we place in a 
properly constructed interval estimate is amply justified. 

Convenience of leirge Samplet. Up to now, all probability statements 
have been based on the assumption that the sarapling distribution of the 
mean is normal. It is reassuring that such a sampling distribution will 
always be normal when the universe itself is normal. A normal popula- 
tion produces normality In this sampling distribution, whatever the size 
of the sample. 

However, it b particularly charactctistk; of sociolotpcal data that popu- 
btions are often not normal. Size of families, income and other social 
variables dbtingubh themselves from ^ical psycholopcal data in that 
372 



lil pAKAurrra Estulitios 



Fjomx 12.1.4 ffu lyu ' t . IC^ Arvrieen Citiet hy Six ef Fey^leiv^ 


MtAX rooo; 

/ 

Pr» Ccrr 

lOtVlW 

2 

l£Cc 

300-299 

SO 

25.0 

3CXKS99 

€7 

33.S 

400-199 

31 

liS 

aaiwoo 

1* 

90 

eoo-Gso 

i; 

ts 

TOO-T99 

s 

Ai> 

K»-47> 

4 

20 

90O-W9 

1 

JS 

IJtlOO er tasw 

3 

10 


2M 

imoTe 


375 




Statistical Intebenc* 


they are often severely skewed. We may therefore raise the question of 
whether such skewness disturbs the required normality of the sampling 
distribution. The answer is: it docs. But at the same time, the effect 
of such skewness can be circumvented by appropriately increasing the 
size of the sample beyond the approximate minimum of 30. As sample 
size increases, the sampling distribution approaches normality. However, 
no general statement can be made about the rate at which the sampling 
distribution approaches normality, ance this rate will vary according to 
the severity of the skew to be offset. But we may gain some insight into 
what might on its face seem a rather remarkable phenomenon by examining 
the following experiment, in wluch we drew samples of size 20 and samples 
of size 40 from a compilation of 106 large American cities for size of popu- 
lation. The original distribution (Table 12.1.3) was highly skewed and 


Toblt 12.1.3 

Popvlaiion Dxitribuiicn, 
106 Amtritan CtiUt 
lOOfiOO and Over, by 
SUt of Population 


Size or Cities 
(XWO) 

/ 

Per Cent 

100-199 

36 

62.0% 

200-299 

14 

13.2 

300-399 

11 

10.4 

400-499 

7 

6.6 

SOO-399 

5 

4.7 

6(KM>99 

2 

1.9 

700-799 

1 

.9 

800-899 

3 

2A 

000-999 

2 

1.9 

1,000 Or more 

5 

4.7 


106 

100.0% 


Bvurot: V3. ol Ui« Cenmt, {U. Clrww oj tkt 

1910 , CAorOctcrutvci e/(A« B«putafwn, ToL 
11. Part I. UB. Sammw 7 , UB. OoTtruneot Pnntlnc 
Oe««. Wubiastos. D C.. 19M. TsbU lb. 


consequently ideally suited for experimentally testing the effect of a 
skewed population on the sampling distribution of the mean. 

The first set of 200 samples (n = 20) was drawn and the mean of each 
sample calculated and tabulated (Table I2.l.4a). This experimental 
sampling distribution is still perceptibly skewed to the right, but not 
nearly to the same degree as was the parent universe. This reduction 
in skewness is a result of the fundamental principle that the means are 
fevefere wfiicfi necessarily cut down the individual extremes 
found in the universe. It follows therefore from the very nature of the 
mean that a distribution of samjrfe means can never be as irregular, or 
fts widely dispersed, as are the individual values which make up the 
374 



S 12.1 PARAUirrER ESTIStATION 



FlQTmz 12.1.5b Hitlatram, DitbibuHon of 100 Sample ifeaTU.n ■ 40 

applying normal probability calculations to their data by taking proper 
precautions. 

The convenience of Urge samples consists in the assurance that the 
aampliog distribution of the mean is approximately normal, an assumjv 
tion which may be in doubt when samples are small. Situations do, 
however, often arise when small samples are not only convenient, but 
even unavoidable. In such instances, modihcation of the foregoing pro- 
cedures is required, although no new principles are involved. This 
adaptation of reliability techniques to small samples ia therefore not con- 
sidered in tlus text. 

Confidence Interval for a Ptreenia^ {Large Sample). Like the mean, the 
sample percentage also raises the issue of reliability, since sample per- 
centages are equally subject to sampling errors. Thus, a given sample 
percentage (e.g., 60 per cent favoring military training) may be off a 
trifle, or may deviate contiderably from the true percentage. Similarly, 
30 per cent of a random selection of married women may be gainfully 
employed, but the percentage in the sampled universe may conceivably 
be as low as 20 or as high as 40. Fifty-one per cent of the sample eleo- 
torate may signify its btention of voting for the Republican nominee 
in the forthcoming election, but unless we can in some way assess the 
accuracy of that estimate we cannot forecast with any degree of confi- 
dence the outcome of the election. Hence, in interpreting a sample per- 
centage, we must again estimate the a%‘«rage sampling error (standard 
error) in order to provide a confidence intervaL 


377 



STATISTICAt INTEBENCE 


samples. Furthermore, the larger the sample, the more successfully will 
the dispersion be reduced. Thaa, when sampling from a markedly 
skewed population, samples even as small as 20 will produce a sampling 
distribution displaying evidence tA a strain toward normality, although 
obviously falling far short of that goal 
Samples of 40 should therefore continue the trend toward normality, 
or at least toward symmetry. Such a distribution is tabulated in Table 
12.1.4b. Gone now is the long tail; instead, the beginnings of symmetry 



Mean ('000) 

f 

Per Cent 


100-109 

0 

0.0% 

Table 12.1.4b 

200-299 

10 

10.0 


300-399 

41 

41.0 

Dtstribuhon oj 100 

400-499 

21 

21.0 

Sample Means, n = 40 

600-S99 

17 

' 17.0 


600-699 

3 

3.0 


700-799 

7 

7.0 


800^99 

I 

14> 


900-999 

0 

0.0 


1,000 or more 

0 

0.0 




100 

100.0% 


are clearly visible. Thus, sociologists who are often called upon to deal 
with skewed populations may draw considerable comfort from the Im- 
portant implications of this little experiment and may feel justified in 



Fioraz 12.l.Sa llutojram, Di^rihiHm £00 Sample Meana, n - jW 




§ J2.1 Parameter Estimatios 


Since intervals constructed in this manner contain the true percentage 
95 times out of 100, we may declare that we are confident at the 95 per 
cent level that the true percentage is not Jess than 5C.81, nor greater 
than 63.19. Since 5 per cent may be assumed to exclude the true per- 
centage, we thus take a 5 per cent risk of being wrong. 

The foregoing procedure is another instance of large-sample technique 
in that it applies to samples having 30 or more cases. But the assump- 
tion of a normal sampling distribution binges on the ratio, P:Q, as well 
as on sample size. As the P:Q balance departs more and more from 
50:50, the sampling distribution becomes increasingly skewed. 

As has already been pointed out in the study of the mean, in the case 
of percentages, the skew in the PiQ balance communicates itself to the 
sampling distribution. Again, as with the mean, this skew can be cir- 
cumvented only by a sharp increase in the size of n, so as to instill nor- 
maUty in the sampling distribution. Table 12.1.5 shows the sample sizes 


TahU 12.1.5 

3f inimutn n /«• StUei<d p-Volues 
/or Homai A^oixmalion 


p 

n 

£ 

30 

A 

50 

.3 

80 


200 

.1 

600 

.05 

1,400 


Ad*pt«d with perminion from 
WilliuD G. Goehraa, Sampfino 
TeeAnkiiKt. p. <1. CopTrisbt 
Witer A las. 


needed for varying sample percentages in order to permit the assump- 
tion of a normal sampling distribuUon. 

The Finite Popuhiian MuliipUer. In actual sociological int'estigations, 
we ordinarily encounter fixute populations, and }’et in the foregoing in- 
f, stances we have calculated the standard error on the assumption of sam- 
pling from an infinite universe. But Uus is not quite so unrealistic as 
ought appear on its face. Fliute populationa are usually large enough to be 
considered as infinite, which is incidentally something of a practical con- 
venience, since reUability procedures ore somewhat less complex for infi- 
nite than for finite populations. For Cnlte populations, the measurement 
of reliability must take into account both sample size (n) and the size of 
the universe (N), whereas for infinite populations, the computation 
of reliability need take into account lonly sample size, since the size of 
the universe is incalculable and hence eazmot vary. 


379 



Statistical Interiscs 


The technique of constructing an inter%'al estimate of a universe per- 
centage is in its essentials no different from tliat of the mean: we 
(i) fix the desired degree of confidence; (2) estimate the standard error; 
and (3) attach to the observed sample percentage the multiple of the 
standard error needed to attain the agreed-upon confidence. The sam- 
pling theory on which this procedure rests is identical with that under- 
lying the treatment of large sample means: namely, the distribution of 
all possible percentages (p) is approximately normal around the universe 
percentage, P, with standard error 



where <r, = standard error of percentage 
P =■ universe percentage 
Q = 100 - P 

But since the parameters P and Q are of course unknown, we again fi^ow 
the convention of substituting the sample values p and q in order to 
obtain an estimate of the standard error: * 




( 12 . 1 . 6 ) 


Cemputin^r an Inienal Etiimatt of q Percentage. Let us suppose that out 
cf 900 randomly selected high school students, 60 per cent respond in 
favor of universal military tramiog and 40 per cent are unfavorable. 
To form a confidence interval, we must first estimate the standard error: 


By adding and subtracting this quantity to and from the sample per- 
centage, we obtain the 68 per cent confidence interval: 

60% ± 1.63 percentage pc^ts 
58.37% - 61.63% 

The odds are 2 to 1 that the true percentage lies within this interval. 
If we should desire greater confidence — aay, 95 per cent — we would 
attach 1.96 standard errors; 


60 ± 1.96(1.63) 

60 ± 3.19 
56.81% - 63.19% 

in th« cAie ol the taein, % better estimate of the standird error of p ii provided 
^ V “ discussion of this peiot, see WUliam O. Cochran. Sampimj T«eA- 

etfwet, John Xtiiey i gons, Ine., New York, 1953, pp. 32-33. 

378 



( 12.1 Pasauzteb EErmuTtON 


gardlesa of the size of the universe, that has given birth to the paradox 
that sample reliability is affected only by the absolute number of cases 
and that the sampling ratio is of no consequence. 

Since the sampling ratio is often relaUvely small, as in public opinion 
surveya, the finite population multiplier is correspondingly ignored, 
which exphuns its textual omisson in some of the briefer discusrions of 
sampling. However, it should be noted that, whenever the universe is 
extremely small, as a small community, the samplmg ratio must neces- 
sarily be lai^, otherwise the number of cases in the sample would not 
be adequate to represent the sampled universe. Thus, the U.S. Bureau 
of the Census has seen fit to draw a 25 per cent national sample in order 
to ensure adequate coverage in the very small enumeration areas. A 
much lower sampling ratio would of course be sufficient for national, 
state, and metropolitan coverage, but it wotild produce an insufficient 
number of sampling units from the numerous tiny subdistricts. 


Some Persuthg Issues in Applied Sampling 

The complete process of estimation as exemplified in the preceding 
discussion is easUy reducible to a few abstract stages: (1) the drawing 
of the sample, (2) the computation of the sample statistic, and (3) the 
foimulatioa of the inference concersing the parameter. 

But sampling is not an abstraction; it is not carried out in a vacuum. 
It is performed on concrete things and events: for example, a campus 
of students, a carload of com, a community of households, a set of opin- 
ions on public issues, or almost any conceivable set of events about which 
there is some scientific curiosity. Being an operation executed in mate- 
rial situations, its validity is conditioned by the nature of the data it works 
on. What are the characteristics of the social data which the sampler 
would have to take cognizance of, in order to insure sound statistical 
reasoning? 


The InstahilUy of Vie Social Univeree. The stages of the sampling pro- 
cedure sketched above seem quite uncomplicated when applied to a bolt 
of cloth, batches of chemicals, a shipment of light bulbs, or the leaves of 
a mimosa tree. The inferences are made, in general, as quickly as the 
data can be processed, and the universe is not subject to disturbing ac- 
cretions, deletions, or deterioration. However, such uneventful sim- 
plicity is not descriptive of the social world, which is much more complex 
and dynamic. Social investigations ore frequently of long duration; 
the universe, the elements of which are very mobile, is often subject to 
alteration by growth as well as diminution, and its significant traits are 
subject to modification while under study — indeed, partly as a result 
of their being studied. Thus, public opinion is subject to vacillation, 

381 



Statistical Inixeinci 


Thb principle is irr agreement with common sense: a sizable fraction 
of a finite universe should be more representative than a negligible 
fraction thereof. A 100 per cent ample would be most dependable, 
since in such cases there can be no sampling error at all. The standard 
error would be zero. Analogously, we expwt a 50 per cent sample to be 
more reliable than a 25 per cent sample, which in turn we expect to yield 
more confidence than a 5 per cent sample. In general, relatively large 
samples are more representative than small ones. The measurement of 
this improvement in reliability is effected by incorporating the Bampling 
Tolio, n/N, in the formula for the standard error. Thus, the standard 
error of the mean, on the assumption of a finite population is ' 



The term, 1 n/N, or the proportion of the population not in the sample, 
is frequently referred to as the ^nite population multiplier,* because it 
measures the improvement in reliability attributable to the finite nature 
of the universe. 

It is clear that the quantity, 1 — n/N, will always be less than 1.00; 
hence, it will always produce a shrinkage in the standard error below 
that for the infinite population. When the sampling ratio is large, It will 
even lead to a substantial reduction in the size of the standard error. 
Thus, with a 25 per cent sample, such as that selected by the U.9. Bureau 
of the Census in 1960, eraployroent of this term would lead to a redue* 
tion in the standard error as follows: 



-.86 


Eighty'Six per cent of the initial SE constitutes a 14 per cent shrinkage. 
Hence, it would here be imperative to employ the multiplier, otherwise 
we would fail to do justice to the accuracy that actually resided in the 
sample data. On the other hand, when the sampling ratio is small, say 
less than S per cent, the finite muIUplier has little or no effect on the 
magnitude of the standard error and may be disregarded.'' Thus, in a« 
sample of 106-200 cases it makes little difference whether the universe 
contains 10,000, 1,000,000, or even 180,000,000 units. It is this fact 
that several hundred sample observations may be equally reliable, re- 


More commonly referred to m tbe^nife popufation corrnfor. However, this term I* 
eomewhit misleading. The fsctor, 1 — n/K, does not eerve to "correct” a faulty result; 
rather it is so integral component hi (he atoodard error formula for finite populations. 
tV^en the sampling ratio is small, this eompoMnt may be sloughed off, whereupon 
standard errors for finite and infimte popnlatioDS become identical If the stand- 
ard error had been given originally for fimte pormUtions. then it would hare been 
neceaaary to "correct” for the infinite population! 


380 



1 12.1 pASAStrrta E^rntATioN 

eiaff.ple - the further U 5 e of the fample t«wches a popabtioa which 
not have “an equal chance of being selected." Such prolong i^e of a 
sample beyond its normal life is analogous to the retenUon of outmoded 
clothing or dilapidated household utaisxls -- they may ^ serve their 
purposes, but not too weH. Anyway, we cannot discard 
we cannot afford new ones. In anew of this dflemma, we are obliged to 
cmn new terms and supplementary concepts which do not cohere racUy 
with the ideal simple random procedure: the “target universe, 
thetieal universe," and the “availability sample" which rre « 

the universe which did not originally have an opportunity to contribute 
to the sample. . .... . 

Tie lesson we ran derive from eonteinpttinB tie eonditions 
doeribed fa woMd: (1) the inteipretive inferences m stnUAra 

neeessniflr tnte on > less rigoron, ehnrocter mrd mnst be innde rtb 
rantion end reserve, end (2) the requirement ef 
subject metier fa ell the more iusfateot, rathoul uhri 
sophisUeeUou eud still eenuot be brought to beer ou the eppbratrou of 
quantitath-e reasoning. 

Sire oS Sompfe. The ineviteble questiou thet^"e» rarty in 

ofm4tigeUonfathetotth«rie.oframple. 

m ptS^ce to the entire orriverse. fa of eoorse e» 

is no economy in a sample that Is larger necess^, reoni- 

foolish to be content rrith . rampletiet fa t<» sn^ “ f/? 

rite eeeoreey. The importenee of tins problem 0 

grrater then one rreold deduce from the mregrr dnumssrons It^' “ 
SLoery trate. And yet the qoertion fa bound 

rimplest rumpling studies. The tedunques eppropnete ? 

ere mettemofflrlyedveueedrtelfatira Bow'ver, rome 

pies ere et hood qhirh «ill serve to pflot the ttodeul qho m not equipped 

"^eTu^rST^- iueredihle he, emen . rample mey 
otteu produce dieuduhle lesults.' hil-btf T; » 

hait-e successfully predicted the outcome of Hp««intion of 

b that observation to be recondled with the foreboding description of 

the unstable sodal world? . . 

Tie mort inuuedfate rapluuetiuu of this epi^t 
even the rodel world umy be viewqi et tunes m ^1’’'=^'“°“ 
poEtiral pun. in a rj-slou eompririug only twyuurie^ 
voters erT^pled^uJiug to welhestebllied errl enu, oug ht 
yield tilgbl y lelfable results — when there is no unfore:^ _ of 

Smi^Scumsleuera ^ ^ f 

mmple sir., we should vfauelire the supromely 

of a perieetly homrseueous universe which, of eeurse, could be esmpM 



fiTATtSTtCAL IXTCKtyCt 


and eome of the aatnpltnp^ of the dewnnia! census are touched with ob^o* 
It^ecncc almost before they arc circulated for use. The bespectacled 
scholar stands contemplating his sample while the social universe which 
it supposedly represents is morving on. Preelection polls are therefore 
taken at short intcrsTils to eliminate the pitfalls of opinion shifts; but 
IhS. decennial censuses arc loo rutnbersome and costly to replicate at 
useful intervals. lienee, we SOTnetimes find ourselves helplessly falling 
back on an anachronistic population base in our sociological studies. 

This dilemma is aggras-ated by the fact that, In the social studies, 
the unit of time over which the social process endures is usually mi^ 
longer than it Is in the studies of the physical world. A meteorologist, 
who would not predict the weather more than a few hours in advance, 
expects an economist to foreca.«t depression and the sociologist to pre- 
dict the course of a delinquent career and to foTC« e < recidivism. 

riif Iltterogtneily of Oie l^ociat Unitfttt. The instability of the social 
world, as previously described, conalitulcs heterogeneity in the chrotv- 
ologieal dimension: divorces, crimes, and marrlsgca are not at all iden- 
tical with those events, which are nominally the same, of a decade 
previously. Public opinion today U not what it was "before the last 
diplomatic note from Tlussb." 

But heterogeneity also etUts In the lateral dimension. The Bupe^ 
organic world, with its variety of peiaonality and cultural tnita, does 
not offer such a homogeneous base for reliable mmpling. Subcultural 
areas within (he metropolitan community display such cultural diversity 
as to render sample designs complex and difficult to execute. The Elm- 
towns, Yankee Cities, and Middictowna, with all their presumed typi- 
cality, cannot l>e cavalierly employed as launching pads for inferences 
al>out other Eimtowns, VanVee Cities, and Middtclowns, even though 
the economy of sociological sampling enterprises sometimes compeb us 
to do so. 

Sociologists might well envy the students of the inanimate world or 
even the botanical and animat kingdom, who preside over a relatively 
liomogpneous univerv; a carload of wheat or light bulbs, a sample of 
dogs or mice in the laboratory But such la not the fortune of the social 
scientist", lienee, as an economy inca.«urc, we are often obliged to employ 
ob'olctc social samples that sbmild have been east aside for more current 
samplings. They arc still more often applied to “target*’ populations, 
which wore not even included as components in the original universe 
from which the sample was selected. 

As an obvious consequence of the practical considerations recited above, 
the ideal model of random sampling often becomes impossible to carry 
through. Even if it is practiced in a pven study — in the census, for 
382 



{ 12.1 Fakuieteb EenuATioK 

sample is impracticabW, as in national <q)inion polling, stratification may 
perhaps be utilized in order to reduce it. 

Work to Bt Per}ormed bs the Sample. The objeclive of any aample is to 
represent the unknown universe vrith a certain desired precision. But it 
can perform this task either well or badly. The principal statistica cn 
tenon of a well-behaved sample is, of course, the Bse of the standard 
error; all other things being equal, a larger sample will produce a scoa er 
standard error, a etnaller sample will produce a larger average 8 

error. It should be unnecessary to reiterate that this princip e g^ 
erally hold only with random samples; for non-random bias^ ^mples, 
there is no safety in numbers, as the calamitous e.xpenence of the LtUrary 
Digest poll of 1936 has amply tesrified. Like a fleet atUete running m 
the wrong direction and reaching the wrong goal so muc ^ 

crease m the size of a biased sample can lead only to an all the mor 
grievous blunder. , , -c . • 

In the second place, if the sample items are to be 
smaller categories, the sample obviously bas to be large enough 
such dismemberment without incurring the hazards of vacsn . 
populated cells. If some of these categories show sparse 
reflecting a very unbalanced trait distribution m t e MTnnle 

delinquents, left-handed persons, Negro physicians), the over^U sample 
wUI neccssarUy have to be adequate to catch such 
Furthermoi the problem of size of sample would be m« h less foim. 
dable U the task of a sample were confined to one ^ 

is rarely the case. We know that a sample can be a very cost y ^tru 
ment. Like any other investment, it is economical to make it do the 
work for as many purposes as possible. Thus, a given sample of sever^ 
hundred household units may be used to S 

religion, rize of family, unemployment, or any other varia e ’ 

Bcope of the surveyor’s scieotiBo cariosity. Not on y W “ 

employed in this multi-purpose sense, but a , 

many subsidiary characteristics which may be inclu e ®®.. 
information in ihc schedule. The importance of ‘1“ J”* 

• tasks b that a sample of a given sire will not be equally efficient for aU 
variables under study. For the simple dichotomy of sea, ““P’® 
may he representative enough, whereas tor school 
ment, and many other more complex distributions, t e same 
will be much less representative. Even the most elementary sarveym wffi 
be eaUed upon to take into consideration the wide diBereneea m patterns 
of distribution. , .•/-*! jw 

Because trails are not equally variable, a sample that satisfies the diH 
mands of a two-value trait wiU probably not bo ®»P'“® “ 

more complex distribution. If such mulU-dimenaional tasks p 

385 



Btatisticjh. iKrrMSCE 


by a single case. Such a anmple cannot be spoiled by c\‘en the most 
arbitrary selection. Tliii l-i the reason why many an othcrwi«c careless 
and inadequate poU does sometimes nicceed in doing its work well and 
is bound to make even an incompetent nin'eyor look good. 

On the other extreme, a hypolhetieal aggregate of 1,000 totally unique 
objects could not be successfully sampled at all. Not even the most 
scrupulous randomization, nor any enlargement of sample site, would 
amciiorate (his impossible eifcum.*taRce. In practiee, of course, neither 
one of thf«c abstractly perfect extremes Is ever encountered. Our popu- 
blions ala-ays fall somerthere on the long continuum between these 
theoretical limits. 

Consequently, the ejue^tion cannot be so simply put as: “ViTial size 
sample?" In such an oversimplified /orm, it can be answrrwJ only eva- 
sively; "It all depend*.” On wh.at does it depend? It depends on 
three interloclung eireumstances; (1) the presumable characteristics of 
the universe which the sample b designed to represent faithfully; (2) the 
work required of the sample in performing its function; and (3) 
the materia) rrsourees at the dispona) of the investigators. 

This last named consideration b, of course, extraneous to the quration 
of etalbtieal principles. Nevertheless, the eoet per ea.se is an important 
budgetary Iwie in all research and srill be a final determinant fn the 
eompromi«e lirtween the optimum sample that we would like and the 
practifsl sample that we can afford. IVyond this acknowledgment of 
the problem of cost accounting in snitipling, thb bsue will cot be further 
discussed, 

CAaroc/cristi’ci e/ the l/nfrcnc. Although, In theory, the eharactcrblics 
of the univerre are unknown, in actuality a rea.soned worker knows quite 
a lot about the universe that he b about to study for a specific trait. 
Since it b an exaggeration to state that the universe or a parameter b 
literally "unknown,” the general characterbties are indeed taken into 
account by the statistical worker. 

Thus, the more heterogeneous the universe b judged to be, the brger 
the eample should be. If there were no variation, there would be no 
necessity to sample more than one item. By corollary, a wide range of 
vnrbtion in quantitative data requiree a correspondingly brger simple 
to assure representativeness. The rame principle would obtain in the 
case of qualitative variables. As has already been demonstrated (Chapter 
7), an even dii-ision in qualitative variables yields maximum heterogene- 
ity, which is measurable by the standard error of a proportion: an 80-20 
split would give a ranaJler standard error thxn a SCy-50 ditisha. T^s, 
to assure a given reliability on a Yf»-No vote, an expectation of an 
approximately even split of votes would demand a larger sample than 
would an 80-20 division. If, for any reason, a large simple random 
^84 



§ 12.1 Paraiietee Estiuation 

4. (a) For a given population, detennine how sample sire would have to be ad- 

justed in order to reduce the standard error of the mean to: 
one-half its original sire 
one-fourth its original 
one-sixteenth its original dxe 
(Hint' Substitute in SE formula and solve.) 

(b) Explain how these results iDustmte the statement that the reliability of 
the mean varies directly with sample oze. 

5. A given sample has the following characteristics: 

X- 11 
s= 3 
n » 100 

If the population mean is known to be 12, what is the sampling error of the 
observed mean? What is the caticnated standard error of the mean? How 
often would you expect sampling errors larger than =fc.5? ±1.00? (Hint: 
Find the required r-mcasure and refer to the table of normal areas.) 

6. Explam the statement that the sampling distribution of the mean is the equiva- 
lent of the distribution of sampling errors. 

7. E^Iain in your oam words why the sampling distribution of the percentage 
will be lacking in e}TQmetry when n is small and PrQ very unbalanced — say, 
00:10. (Sufgettion: Draw an sppropriate graph with the percentage scale on 
the base line.) 

8. Suppose that a complete set of attitude scores has a mean of 1,000 and a 
standard deviation of 200. 

(a) If 25 scores are picked at random, what is the probability that their average 
score will be less than 950? 

(h) Grrater than 1,100? 

(c) Between 900 and 1,100? 

(Ilinl: Obtain the SE and r, and consult table of normal areas.) 

9. Discuss the statement: “The wider the interval estimate, the greater the 
d^ree of reliability; the greater the degr ee of precision, the less reliable the 
interval estimate." 

10. (a) Stateinwhat sense the standard eiTorfaaaa\-erage of all possible sampling 

errota. 

(b) Will an analogous statement hold for the standard deviation? 

11. Assume a sample survey of a college student body on the following charac- 
teristics: ( 1 ) proportion who read the newspapers r^ularly; (2) attitude 
towud segregation; (3) opinion on rules forwomen’srealdencehalb; (Dpro- 
portion who receive A gr^es; (5) proportion who attend church regularly; 

oi^i»etivea; (7) social cl^ of parents; <5) proportion gainfully era- 
“d their earnings. Would a single sample serve to repre s ent all of the 
above characteristics equally weUT Why or why not? 


387 



StITISTICXL JjITZSrSCE 


for a given sample, the sample has to be so designed to meet the peak 
requirement of the most demanding individual piece of investigation. 
Such an all-purpose sample, for instance, b supplied by the U-S. Census. 
The sample sire of 25 per cent, set for the 1960 enumeration, is large 
enough to satisfy any sampling need that can be anticipated — from the 
smaller towm to the giant mclropoiia. 

The statistical sample thus constitutes a major road to quantified 
knowledge. If it never gives us certmnty, but only probabilities, it shares 
this probabilistic quality with knowledge of every type. It simply is 
not given to man to be completely ignorant, any more than he is omni^ 
cient or omnipotent. This is another way of saying that statistics is 
a truly human science — a tool in the pursuit of knowledge that will 
never terminate. 

Questions and Problems 

1. Define the following concepts: 

Statistical Inference 

Estimation 

Parameter 

Statistic 

Sampling Error 

Sampling DUtributtoo 

Confidence Interval 

Standard Error of the Mean 

Standard Error of the Percentage 

Inter%-al Estimate 

Point Estimate 

Large Sample 

Fmite Population MuldpUer 

2. Using the table of random digits (Table IV, Appeadir), draw two samples of 
30 cases each from the Ustof 107 euicide rales (Table 3.1.1a}. 

(a) Compute the mean and the standard deviation of each sample. 

(b) Compute the sampling error of each sample mean on the basis of the known 
true mean. 

(c) As a class project, arrange the means from (a) above m a frequency table. . 
Compare tlds distribution with the empirical distribution presented in the 
text. Comment on the difference. 

3. (a) Estimate the standard error of the mean from each sample of 30 cases 

from the preceding exercise. 

(b) Explain ivby the estunalcd standard errors are not identical in value. 

(c) Establish 95 per cent confidence intervida by adding to and subtracting 
from each corresponding sample mean 1.98 standard errore. 

(d> ^ a class project, determine nhat percentage of the 93 per cent confidence 
intervals contain the true mean. 


386 



5 125 Testino a Stahstical HTPonrcaa 

then, does the null hj’pothesis differ from any other hypothesis — from 
a hypothesis, for example, stating that juvenile delinquency is associated 
Tvith status discontent, or that the incidence of suicide depends on the 
degree of social disorganization? These hypotheses Tve may call ex- 
planatory hypotheses, whereas the null hypothesis is an auxiliary device 
which tentatively denies the validity of the explanatorj' theory. Hence, 
if we reject the null hypothec, we strengthen the credibility of our 
explanatory hypothesis. But the null hypothesis itself has no explanatory 
value. 

Yet science is in search of relationships. Hence, in accordance with 
the above reasoning, the null hypothesis is usually launched with the 
expectation, indeed with the hope, that it will be nullified, as the deri- 
vation of the term implies. Nevertheless, it represents a possibility 
that must be disposed of before alternative hj'potheses which imply 
assignable causes can be considered. 

Returning to the comparison of eastern and western cities, we would 
set up such a comparison in the first place only in order to uncover some 
of the factors that determine the incidence of suicide. If we do discover 
a significant difference between the average rates of East and West and 
consequently reject the null hypotbesb, we will have added a small 
increment to o\ii knowledge of suicide — namely, the regional factor as 
a source of variation in the suicide rate. If, on the other hand, the two 
samples of data show no significant difference and we accept the null 
hypothesis, then we have not advanced our understanding, although 
supposedly we will not have retrogressed. In this latter case, we would 
have to contrive new kinds of comparisons in our search for the assign- 
able causes of suicide. 

Thus, it is only when the preliminary hypothesis of “no difference” 
has been cleared away that we gain some insight into the occurrence of 
suicide. Hence, the null hypothesis is inherently linked with a more 
constructive statistical hypothesis, sometimes called an alternative h}’- 
pothesis, which becomes tenable to the extent that the null hypothesis has 
been discredited. Such an alternative hypothesis may specify an exact 
degree of difference between East and West, with a view to gauging the 
strength of the unknown variables which produce suicide in a disorganized 
society. Clearly, there is no point in pursuing the effects of a given control 
or experimental variable such as geographical region if we have found in 
favor of the null hj-pothesis. Thus, by analog}', the null hypothesis serves 
the purpose of a criminal trial: we set up the hypothesis of innocence, 
giving the evidence an opportunity to nullify it. Only if and when that 
presumption is rejected does the court ©ve thought to alternative punish- 
ments corresponding to the degree of guilt. 

In accordance with the forgoing principle, the null hypothesis has 
come to be predominantly identified with two types of research pro- 

389 



Statistical Interekcs 


Section Two 

Testing a Statistical Hypothesis 

Hyfoihesii’Tesiing and Etlimatum Compared. There are two general 
types of statistical inference; (I) eslimaiion, which begins without any 
stated assumption about the value of the parameter and merely seeks 
to estimate descriptively what the parameter could be; and (2) hypoltusu- 
testing, which begins with a hypothesis about the parameter and then uses 
the sample data to check the fdausibility of that statement. 

In the previous section, we were concerned with problems of estima- 
tion. We began, for example, with the observation of a sample mean, 
and from this we derived an interval estimate with a specified degree 
of confidence. We first drew our sample and then made our estimate 
of the parameter. 

But in hypothesis-testing, we formulate our hypothesis about a param- 
eter in advance of the coiiection of the sample data, which is then used 
to test that hypothesis. We may, for example, bypotheaze that the aver- 
age suicide rate of eastern cities does not differ from that of western cities. 
Wc begin with that supposition, and then we take an appropriate sample 
of eastern and western cities, compare the two means in the prescribed 
manner, and finally reach a decision whether, in the light of the sample 
difference, the hypothesis should be accepted or rejected. 

Although an interval estimate b always derived from the previously 
compiled sample data, a statistical bypothesU involves quite another 
analytical process. This process starts with an antecedent conjecture 
about an unknown papulation value, presumably arrived at without 
benefit of the undrawn sample. FHirthermore, a hypothesb b tested and 
then acted upon: it b either accepted or rejected. It requires a decision, 
which b made with a cert^ degree of confidence, and which b either 
correct or incorrect after it has been made. The emphasb thus shifts 
from mere estimation to dectsfon-maktng. 

The Null Hypothesis (i7»). It baa become a conveoUon in statbtical 
testing to open the investigation with the null hypothesb, sjTnpoIized 
Ho In its most current usage, thb hypothesb holds that two or more 
given samples have come horn statistically identical populations and that 
therefore any observed difference between such samples b merely a 
chance variation. The afores^ hypothesb that East and West do not 
differ in their average sutidde rates would therefore be a typical nuU 
hypothesis. 

Essentblly, the null hypothesb b set up to be nullified; however, 
every other type of hypothesb b also set up for that possibility. How, 
388 


S 122 TEsnNQ A STATimCAi/ Ettotbesis 


ative of an actual difference between the means of the sampled popu- 
lations, or whether it could more plausibly be accepted as mere sampling 
variation. By rule of thumb, we lay down the null hypothesis that the 
population means of East and West are identical and subject this hiTJOth- 
esis to the statistical test. If we reject this hypothesis, we then proceed 
as though there were a real differenre, whereas acceptance presumablj 
implies that the two populations are alike and that the obser>’ed differ- 
ence is a mere sampling error. 

An important principle of our testing procedure is that ire con new 
prore the nidi hypothetia inu. The best possible evidence that we could 
ever obtain for the identity of the two population means would be an 
identity between the two sample means. But even if samples of East 
and West were to show identical averages, we still could not be positive 
that the null hypothesis was true. Such an identity between sample 
means coidd itself very well be the result of sampling errors, because 
population means could differ by a sizable amount and the sample differ- 
ence still be zero. Similarly, the finding that a sample of married spouses 
are of the same average age would not prove that all married couples 
show equal averages. From sample observations, the conclusions we draw 
about the truth of the null hypothesis are necessarily of a probabilistic 
nature. 

If we can never prove the null by]x>thesis true, may we prove it defi- 
nitely false? Here the statistical evidence may be more convincing. 
The best possible evidence that the two populatioiis differ would be a 
difference between samples. A smaU difference would not be very com- 
pelling; however, aa this difference becomes larger and larger, the case 
for a population difference becomes stronger and stronger. If the differ- 
ence between samples becomes so brge that it is highly improbable 
that they stem from the same universe, then we may with practical 
assurance reject the null hypothesis. Even here, however, there is no 
infallible certainty, since the one case in a million — the freak event — 
may happen. Highly improbable events are regularly occurring, how- 
ever startled we are when they befall us. But such an extreme case — 
the one in a million — is so improbable that most of us would discount 
it by regarding it as impossible, thereby rejecting the null hypothesis 
with great confidence. In other words, we are willing to reject the null 
hypothesis when the statistical probabiUties of being wrong are small 
enough to suit our purposes. 

These decisions — whether or not to reject the null hypothesis — are 
accordingly made with varying degrees of confidence. This confidence 
varies according to the probability that a difference at least as large as 
that observed could have been obtained by chance, assuming the truth 
of the null hypothesis. We roust therefore now turn to the procedure 
by which such a probability is determined. 


391 



Srinmcii Iotimwc* 


cedures; (1) the cornpariaon of two or more populations on a pven trait 
and (2) the correlation between two or more tnuta in a given population. 
In the first type, the null hypothec posits no difference between popu- 
lation parameters; in the eecond type, it asserts a chance relation, or 
zero correlation, between the variablea under study. 

Pn?icipi«s of Testing the Null Hypothesis. Let us now suppose a random 
sample of 30 eastern cities and a comparable sample of 30 western cities 
whose mean suicide rates are 10.4 and 14.3, respectively (Table 12.2.1). 
We wish to know whether the observed difference of 3.9 could be indio- 


Table 12,2.1 

Sutetde Rales, Eastern 
IVestem Cities, n - 



390 




S 125 TrSTcro a STATOncAi, Htpothesis 


value is sufficiently extreme, or improbable, to justify tbe rejection of 
the null hypothesis. To put it quantitatively: how many times out of 
100 could we expect to obtain this, or a larger, sample value if the actual 
difference is zero? 

To obtain thia Pr-value, we empty convert the observed difference 
into a standard deviate, or r-measore, and consult the table of normal 
areas. Since the mean of the sampling distribution is by null hypothe- 
sis rero, we need only dhdde the estimated standard error into the ob- 
served difference to obtain 2 , which tells us bow many sigma units our 
observed difference lies from tbe hypothetical mean of zero. In symbols: 


(Xi - X«) - 0 

•d 

D 

*0 


(125.3) 


This 2 -measure is generally called the ngnificanct, or cn'lteol, ralio, since 
its magnitude determines whether we judge the observed difference to 
be statistically significant — that is, whether the probability of the criti- 
cal ratio is small enough to reject h*. 

To illustrate the calculation of such a critical ratio, we process the 
data of Table 12.2.1, in accordance with the requirements of Formula 
12.2.3. 



(1) We compute the difference between sample means, attaching no 
sign, since our interest lies in tbe absolute magnitude of the difference 
irrespective of direction. 

D = 10.4 - 145 
= 3.9 

(2) Compute the standard error of the difference: 

•d = 

= v1!67 
-15 



393 




SrATlSnCil. iKTEBEKCe 


The Sampling Distribuium of At lyiffereaee Between Meant. In the chosen 
example, this procedure rests on the sampling distribution of the differ- 
ence between two sample means, ■which here functions as did the sam- 
pling distribution of the mean in estimation. To define and illustrate 
tins concept, we may imagine a sampling experiment in which we alter- 
nately sample from two well-defined universes, selecting a large random 
sample of size rn from one population, and then a large sample of size 
n, from the other. By replacing samples, we may continue to draw sam- 
ples in this manner indcBnitdy. For each pair of independently selected 
samples we compute the difference between (he two sample means, amass- 
ing in this way as many difTercnces as there are pairs of samples. We 
now pose the problem: how wiU these hypothetical diff^ercnces between 
paired means distribute themselves, and w^t will their mean and stand- 
ard error be? According to sampling theory, an infinite supply of such 
differences — labeled the ram pimp ditfritmlion of the di^rrmce — will have 
an approximately normal distribution, with its mean equal to the true 
difference between the universe means, and standard error expressed as 
follows: 


(12.3.2} 


02 . 2 . 2 ) 

Thus, the standard error of the difference between sample means, 
like the standard error of the mean itself, reOects both the degree or 
variation within the sampled populations and the respective sample 
sizes. The smaller the population variances, and the larger the sample 
n’s, the smaller will be the standard error of the difference. 

Testing Procedure. In testing the null hypothesis, the observed differ- 
ence is treated as a single value in a normal sampling distribution whose 
mean is zero (indicating no difference between the. population means), 
and whose standard error is estimated according to Formula 12.2.2, 
above. 

We should recall that the sampling distribution gives us the range of 
all possible sample differences. Therefore, all samplmg values within 
fJiaJ Cieoreticaf range are potstbie, but the extreme values in the tail 
of that distribution are, of course, much less pro6o6fe. Within this 
range, we now wish to find the probable location of the single difference 
which we have observed, so that we may decide whether this sample 
392 





llie estimated standard enor wilt then be: 




{ 12^ TeSTWO a SlATISnCAL HtTOTHESlS 


a purely random manner. Still more confidently would we reject the 
null hypothesis of no difference between East and West, since the Pr- 
\-a!ue is only .0027, and tacitly accept an altematire of some difference. 

In making his decision, the worker could set up any level of sgnifi- 
cance which he thought prudent as the threshold at which to reject the 
null faj’pothesis. This means that he could set up any degree of risk which 
he is willing to assume: .001, .01, .05, .10, or even .50, according to the 
circumstances. But the agony of making a fresh selection of the level 
of significance in each instance would be a painful business. Hence, the 
statistical worker finds welcome relief in the .05 convention, which pre- 
scribes that the null hypothesis is automatically to be rejected whenever 
the probabiHty of being wrong in that decision is 5 per cent or leas. 

We may gra'Phicaily illustrate the foregoing argument by the theo> 



Ficmus 12.2.1 Sampfin^r DiAxbvium tjf Differtnee, 
fh = 0, £5 Aeeeplanee Zone and S)t5 Rtjulion Zones 


retical sampling curi'e, which has, under the assumption of the null 
hjT>otheEis, a mean of lero. The shaded rejection zones are located in 
the tails of the normal cur>'e, and are bounded by ordinates erected at 
the chosen significance les*e!. By a decision arrived at prior to the sam- 
pling itself, the null hypothesis is rejected whenever the sample obser- 
vation falls in the rejection zone, otherwise it is accepted. 

Now, whenever we reject the null hypothesis, we tacitly accept some 
unspecified alternative hypothesis which has its own sampling distri- 
bution. Although any observed sample value is necessarily consistent 
with the null hypotbeas, it is also consisteot with innumerable alterna- 
tives, any one of which when tested may be more acceptable than the 
null hypothesis. The truth of this asertlon is readily demonstrated by 
Figure 12.2J2. The farther our observed difference is from the zero mean 
of the null distribution, the nearer it necessarily is to some alternative 
which is more likely to be true. We may therefore pose the dilemma: 

395 



STinsncAL Ioterencs 


(3) Express the observed difference as a standard measure, which is 
the sigma distance between our observed difference and the assumed 
mean of zero: 

D 

* = — 

*0 , 

= 2 :? 

1.3 
= 3.0 

(4) Entering the table of normal probabilities {Table I, Appendix) 
with this s-mcasure, we find that .9973 (99.73 per cent) ol the values in 
a normal frequency distribution lie between the mean and ±3 sigmas. 
Therefore, only .0027 (27 out of 10,000) of all possible sample differences 
will be larger than the one obtained, assuming the truth of the null hy- 
pothesis. 

Dtnsion-Making. We must now decide on the basis'of this Pr-value 
(.0027) whether or not to reject the null hypothesis. If, on the incom- 
plete evidence of the sample, we too glibly accept the assumption of no 
difference between the universe values, we run the risk of overlooking a 
genuine difference. If, on the other bwd, we reject the assumption of 
Ht too hastily, we run the rbk of inferring a rignificant difference where 
none may actually exist. How, then, do we decide? 

There are two related steps in the process of statistical decision-making 
which must be differentiate: (1) an evaluation of the obtained prob- 
ability, and (2) an assessment of the consequences of a wrong decision. 
In the above calculation, the observed difference between East and West 
and larger ones could have occurred by chance only 27 times out of 10,000 
samplings on the basis of the null hypothesis. Although this is a rather 
rare event, it is still not inconsistent with the null hypothesis. And 
since it is consistent with the null hypothesis, we necessarily run the 
risk of being wrong 27 times out of 10,000 if we rejected that theory. 
But when the probability drops to such a low level, we are wont to re- 
gard the observed difference as sbifisItMUy significant rather than as a 
mere freak of sampling. We therefore state that we “reject the null 
hypothesis at the .0027 level ngnificance,” since the probability of 
being wrong is so trifling. 

It is somewhat analogous to a situation la which a student marks nine 
out of ten True-False questions correctly. With only one mistake out 
of a posrible ten, could wc assume that the student has merely guessed? 

tics waUow.e wwaWl be pce^VAe by liiance, the prohabiVity Vs 
not very high — approximately At this .01 significance level, 

most teachers would intuitively tend to reject the hypothesis that the 
student was totally ignorant, and that he had marked the answers in 
394 



§ 12.2 Testing a Statisticai. Hypothesis 

rather embrace? A liberal, humane teacher might prefer to run one type 
of risk, a rigid pedant would accept Uie <^her. The former would be 
disturbed by the possibility of failing a good student; the latter might 
shudder at the possibility of passing a poor one. Thus, two different 
observers might very well come to opposite decisions on identical sta- 
tistical evidence because the cost of an incorrect decision will not weigh 
equally on each decision-maker. In fact, when there is “everything to 
gain and little or nothing to lose,” or “e^'erything to lose and nothing 
to gain,” the statistical odds may even be completely ignored. 

Such dilemmas arc universal in everyday affairs. What are, for ex- 
ample, the consequences of condemning an innocent man, or releasing 
a guilty one? In this case, the moral standard of the culture usually 
dictates the decision. According to contemporary Anglo-Saxon ethics, 
it would be better to free a guilty man (accept false H) than run the risk 
of condemning an innocent one (reject true H). In some other cultures, 
the potential evil of allowing freedom to a heretic has been considered 
60 great that the lesser evil has been to condemn the faithful innocent. 
The frustrated motorist cannot run the risk of parking alongside a fire- 
plug which has not been used for 50 years because of the dire consequences 
of being wrong — that is, the expected loss in the remote case of fire. 

Again, the probability of on-coming traffic around a blind curve may 
be very small; nevertheless, the prudent driver prefers fhe loss of a few 
minutes of time to the risk of life and limb; for the consequences of being 
wrong, however slight the probability, are so grave. However, in an 
emergency, even such risks are accepted, many times quite successfully. 

From the foregoing analysis, it becomes evident that ttaiisties is aiU 
lo measure the risk of being wong, but tl cannot advise an interested per- 
son whether or not to accept that risk. tVhelher one accepts the measured 
risk will depend on considerations that are subjective, personal, ethical, 
and economic. Ironically enough, a statistical decision is thus grounded 
on non-statislical considerations. A penniless vagabond would avoid a 
given financial risk that would not deter a plunger with money in the 
bank; a robust man who abhors overshoes, raincoats, and umbrellas 
would accept the risk of rain which a consumptive would not dare to run. 

There are, of course, many decisions which do not seem to matter 
either way, even when later events have proved them wrong. Either 
the costs of being wrong do not differ in seriousness, in which case the 
decision-maker may have a difficuft dilemma on his hands; or the pen- 
alties of being wrong may not be too grievous. For example, in purely 
academic studies, such as whether there is a difference in personality 
scores between employed and unemploj-ed wives, no policy or action 
maj’ follow upon a statistical decision to reject the null hypothesis, which 
remains largely in the realm of paper work. A^Tiatever consequences 
there were of being wTong may indefinitely deferred, or in the end 

397 



Statistical luTEaENCE 



Tigvrb 12.2 2 Sampliit^ Dubibulion fff Di^erenee, llyp^ 
tkttieal Differertcts Btttveen Population Weons, 0 arid A 


as the risk of rejecting a true hypothecs decreases, the risk of sccepting 
a false hypothesis correspondingly increases.* How, then, should we 
arbitrate between these two types of risks? 


Aasessmenl of Penonal Ruks. No reasonable man will ever decide an 
issue only on the basis of statistical probabilities. He will eonuder the 
uncomfortable consequences of being wrong as well as the agreeable 
benefits of deciding right. He will weigh these subjective alternatives 
as well as the objective quantitative probabiblies. At this juncture, the 
modem statistician temporarily lays aside his statistics and falls back 
upon the pleasure-pain principle (the “felicific calculus”) of nineteenth 
century utilitarianism. 


In the foregoing cases, the teacher might have argued that it would 
be perfectly possible (even though not highly probable) for the student 
to have marked nine out of ten answers correctly by chance while know- 
bg little or nothing about the subject. Bdng therefore still unconvinced 
about the student’s knowledge, be could decide to fail him (acceptance 
of the null hypothesis that the observed sample is only a chance varia- 
tion). To be sure, the teacher would thereby run the calculated risk of 
failbg a good student who, in fact, was actually well informed. On the 
other hand, in passbg the student, he would run the risk of passmg a 
student who had, in fact, answered the questions successfully by sheer 
chance. Either decision will involve some risk of deciding wrong since 
fie mli Aypotiesis can never be proved or disproved anyway. In the 
light of personal or social consequences, which risk would the teacher 


• In statiBlks, the rejection of a true bypotheda b termed a Type I, or 
while the acceptance of a false hypothesis is termed a Typo II, or beta. 


596 


5 125 Tesitno a Btatwtical HfroinEsa 


tion to analyze further the determining factors of the phenomenon tinoer 
investigation • — in our illustration, to determine whether the concept of 
"geographic region” should be probed further. Such a liberal interpre- 
tation of the critical ratios would create an opportunity for the enrich- 
ment of research, while rigid adherence to the stereotjTJed 1 and 5 per 
cent levels would prematurely cut off many possibilities of uncovering 
new variables. The implication is that we do not sufficiently evploit 
the uses of the null fa}*pothesis when we terminate a study upon its re- 
jection. The rejection of the hypothesis will provide an opportunity to 
continue study, and to investigate possible alternative hypotheses on 
the magnitude and nature of the difference which has been tentatively 
accepted as true. Unfortunately, many workers in social science do not 
avail themselves of the opportunity to take what is, after all, the next 
step in scientific inquirj’. They are content to decide for or against the 
nuU hypothesis and go no further. 

Other Applications of the Nall Hypothesis 

Cemparioon of Tvo Pereenlagee. It U a common belief that women are 
more pacifistic than men. In measuring the degree of diiference between 
sexes, let us suppose a sample survey in which CO per cent of the men 
favor a militant foreign pohey, but only 50 per cent of the women favor 
such a belligerent position. We display this result in a 2 x 2 table (Table 
12.2.2). Is it possible, we may a<-k, timt tlib diiference of 10 percentage 


Talk 12.22 


Foreign Policy Fatored, 
by Sex 


points « a mere sampling error, and that the percentage favoring a "get- 
tough” policy Is actually the same in both populatioiw? Such a differ- 
ence could conceivably arise in the cour^ of random sampling, even 
though the two sampled universes »vcre exactly alike. Evidently, we 
have here another occasion for a let of the null hypothesis of no differ- 
ence. A rejection of the null hypothesis would tend to sustain this pop- 
ubr belief, whereas acceptance would contradict it. 

The technique of testing for the possible significance of the observed 
difference between two sample percentages is identical in fundamentals 
with that set forth for two sample means. To illustrate this procedure, 

399 


POUCT 

Male 

Fejiale 

Militant 

C0% 

50%. 

Nvn-mihtant 

40 

50 


100% 

100% 


A' - 400 

.V - 400 


Source llnroibeliral 




Statisticai. Interescb 

even evaded. The decision is more in the nature of an intellectual adTCn- 
ture than a practical matter. 

In any case, it should now be evident that decision-mating is rela- 
ti^dstic in character, differing from person to person and from group to 
group. An incorrect decision to accept what later turns out to be a 
defective carload of wheat would carry much more serious personal and 
commercial consequences than Tvould an ascription of personality score 
differences to employed and unemployed wives where none actually exist. 

The Level of Significance. If the foundation of the above relativistic 
arguments is reasonable, it would not seem very sound to adhere in- 
variably to the .05 and .01 significance levels in all types of problems 
and situations, as is now the conventional practice. The coneiusion 
seems inescapable that individuals, in their informal and unquantified 
dilemmas, often use more discernment in making their decisions than 
many statistical workers in the social sciences. I! the field of statistics 
is an extension of common sense, this should not be so. There arc, of 
course, occasions when high certainty might be deemed defensible. In 
the acceptance of a new drug, it would certainly seem humane to await 
the attainment of considerable certainty before concluding that the new 
drug differed significantly from the old. The over-eager floating of in- 
sufficiently test^ medication might produce and has, indeed, pro- 
duced — deleterious effects on the population. But such is not the general 
attitude of the long-range gambler who can absorb non-fatal and short- 
term losses. His guiding rule » the old adage: “N’othing ventured, 
nothing gained” — a low threshold of risk. The more cautious and 
conservative decision rule is to "look before you leap” — a high thresh- 
old of risk. If we set the significance level too low — if we are overly 
cautious — we will exclude many new ideas and experiments; on the 
other hand, if we incautiously set it too high, we may wastcfully follow 
many false leads. 

Returning now to the difference between the suicide rates of eastern 
and western cities, we are actusOy interested in probing factors which 
cause the vaiiation in rates. In such an exploratory study, it would 
be quite reasonable to enlarge the scMalled rejection zone by lowering 
the critical ratio from 2.58 (1 per cent level) to 1.96 (5 per cent level) 
or even still lower. A tentathfe assumprion of a genuine difference 
would even be justified at the one-agma point (32 per cent level). The 
consequences of wrongly rejecting the "no-difference” hypothesis 3n 
this case are not very serious anyway; and the impulse to further in- 
vestigation, which would have been suffocated by accepting the null 
hypothesis, might yield very pronounced benefits. 

It should be recalled that the null hypothesis of no difference is, after 
all. merely a skirmish to determine whether there is sufficient justifica- 
398 



§ 12^ Testing a STATisncAt Hypot hes is 


the stub of the table at 2.9 sigi^, we find that differences at least as 
large as ±2.9 can be expected to occur 2 times in 1,000. This completes 
the computational side of the problem, and puts us in a position to judge 
the plausibility of the null hyjwlhesis. 

In \'iew of this small probabilit}’ of being wrong, we might routinely 
reject the null hypothesis and tentatively accept the ahemative h}*- 
pothesis that women are more paciGstic than men. But even here, with 
only a Pr = .002, we would still have to consider the consequences of 
being wrong in that decision. Accordingly, we may imagine an anxious 
government spending millions of dollars in a time of cri«is to reshape 
feminine opinion, on the assumption that women are paciiistic. But if 
that verdict is wrong — if women are alreadj* as militant as men — then 
millions will hare been spent unnecessarily. For that reason the go%*- 
emment might be reluctant to embark on an expensive propaganda 
campaign unless they ivere practically certain of the faUity of the null 
hj-pothesis. But insisting on practical certainty of its falsity has a price 
of its own; for, by reducing the ivk of rejecting what could turn out 
to be a true hj-potbesis, we increase the risk of accepting a false one:. 
This is the dilemma e\ery decLdon-maker must somehow resolve. In 
this instance, it may appear less serious to reject the hypothesis of no 
difference (though true) than to accept it (though faL«e). The propa* 
ganda can presumably do no harm, excepting the Gnancial cost to an 
affluent government, and it may do some good. 

The whole argument, which may seem needlessly prolix, may be wrapped 
up in the generalkatioa that the null hypothesis will always be rejected 
whenever there is everything to gain if right, and nothing to lose if wrong. 
A person may a^ume that he has no vitamin deficiency (//») and still 
take vitamin tablets (reject //») since he has everything to gain if he needs 
them and little or nothing to !®e if he doesn't need them. 

We reiterate that the detailed procedure set forth above for comparing 
two sample percentages is applicable only to large samples. Only then 
will the theoretical sampling distribution of differences be normal, as 
was assumed in setting up the probaWIity of differences larger than the 
observed. Moreover, the samples must be extra large in.'ofar as the 
sample percentages deviate markedly from 50 per cent. It is therefore 
necessary to scrutiniie each application in order to ensure that the sample 
data meet the conditions which the technique presupposes, a rule which 
holds for every other statistical application as wcIL 

Comporum cf Three or Hare There will be occasions to 

compare percentage distributions of sample from three or more uni* 
verses in order to test the hypothesis that the unitTfses do not ddfer 
. — a typical null hypothesis. For example, the percentage cf cross-cla.^ 
dating, as observed in samples of hi^ school boys, may differ frcca one 

401 



STATISnCAL Intebemce 

we process the samples of men and wamen shown in Table 12.2.2, assum- 
ing that each sample has 400 cases. 

(1) Calculate the numerical ^fferenee between the sample percentages: 
D = Q0~^ 


We conceive of this angle observed difference as one of all possible differ- 
ences within a sampling distribution which is assumed to be normal 
around a mean of zero as required by the nuH hypothesis. 

(2) Estimate the standard error of the sampling distribution of the 
difference according to the formula: 





(12.2.4) 


where p ■ 

9* 

Solving: 


nipt + 

ni +nt 


the weighted average of the two 
percentages, which estimates the 
eonunoD P, assumed by H» 


400(fi0) + 400(50) 
P “ 400 -h 400 

-«3 


and; 



-3.5 

(3) Compute the significance ratio by dividing the estimated standard 
error into the observed difference, which reveals how many sigma units 
our observed difference is distant from the hypothetical zero mean; 

^ D 
So 

JO 

”3.5 

-2.9 

W the toWe wkssmA pnJbsftafilies Cfabie 1, Appendix) with 
this sigma measure, and fix the probability of obtaining a difference 
this large on the hypothesis that the true difference is zero. Entering 
400 



§ 12.2 Testing a Statisticai, Htpotbesis 


class, 27 (54 per cent) state that th^ engage in cross-class dating; 25 
(50 per cent) of the middle class and 23 (46 per cent) of the lower class 
report this behavior. It is the differences among the sample frequencies 
that set up the problem: do these observed differences represent genuine 
differences among the three social classes? or are the populations prob- 
ably homogeneous? 

To provide an answer to this question, we first arrange the data in a 
2X3 contingency table (Table 12.2.3b), which serves to display (a) the 

Social Class and Type of Daiiny, 

Table 12.2.3b Observed Frequencies (0) 


Dating 

Social Class 


Type 

Upper 

Middle 

Lover 


Cross-Class 

27 

25 

23 

75 

In-Ckss 

23 

25 

27 

75 

Total 

50 

50 

50 

150 


number of cases in each sample, (b) the number of cross-class daters 
in each sample, (c) the numb^ of cross-^Iass deters in all samples com- 
bined, and (d) the grand total of cases. 

Second, we prepare a simitar chart showing the expected frequency 
of each cell (Table 12.2.3c). These chaace^requencies ore derived ac- 


Social Class and Type of Doling, 
Table 12^.Sc Expeeied Frequencies iS) 


Dating 

SooAL Clam 


Ttte 

Upper 

Middle 

Laver 


Cross-Class 

25 

25 

25 

Ih 

In-Class 

25 

25 

25 


Total 

50 

SO 

SO 



cording to the simple principle that percentage distributions within 
Kiraples shall be identical with the corTc«ponding marginal dUtribution. 
This marginal distribution gives us the best estimate of the common 
percentage distribution posited by the null hypothesis. Since the di»- 

403 


















StATISTICAI. iNtrittNCS 


social cla^ to another (Table 12230.), but it b still possible that these 
observed differences are a result of sampliog variation rather than of 
social class practice. 


Soctat Cta$$ and Type of Dot- 
Table 12 ^.3a tny, JIfofe High School Sludenls 


DiTINQ 

Type 

SoCtAl. ClASS 

Upper 

iliJdU 

Lowr 

Cros&<^la9s 

Sl% 

S0% 

<6% 

In-CIoss 

46 

so 

54 


100% 

100% 

100% 


Baurrc H7poth(ti«^ 


Chf-S^uore (x*) as a Teal of 5ijn((fconce. When as many as three sample 
percentages are being compared, as in this example, a different tecb« 
nique is used from the one that applies to the case of only two samples. 
Id order to compare all percentages simultaneously, it b the versatile 
eAw^uore technique that must now be employed. But the principles of 
testing remain the same: (1) the null hypothesis is formulated that 
social closes arc indistinguishable in their dating behavior — that is, 
that the universe dbtributions are Aemoyeneoua; (2) a composite index 
(%*) of the magnitude of the observed differences is computed; which 
(3) permits us to ffx the probability of obtaining larger percentage differ- 
ences under the null hypothesis. IVhelher or not Ht is rejected depends 
on the magnitude of that probability — a small probability points to its 
rejection, a large one to its acceptance. 

The manner in wluch x’ measures the differences between the observed 
frequencies (percentages) and those cTpecled under the null hypothesis 
has already been shown in the discussion of the contingency coefficient. 
As these differences increase in magnitude, the value of x* likewise in- 
creases. Hence, the null hypothesis becomes progressively less tenable 
and more likely to be dismissed aa x* increases in magnitude. Thus, a 
relatively large x’ tend to discredit the null hypothesis, whereas a 
small X* *111 tend to uphold it, depending as usual on the calculated 
risks. 

Comjmlalion of x*- Fundamentally, ia computed on the actuid fre- 
quencies instead of on the perccBtages, since only in that manner will 
sample size be properly weighted. Reverting to Table 12.2.3a, let us 
suppose then that we have three samples SO boys each. Of the upper 
402 




§ 125 TEsTDfa A Statistical Htpothesis 


Sixth, and finally, we sum all nonned cell deviations to obtain x’- 
In sjTnbols: 

= .&t 

In our example, the sum of the normed deviations is .fit. This is the 
end-statistic to which a probability (Pr-value) is attached for the pur- 
pose of testing the initial hj'pothesis of no difference between population 
proportions. 


Fixing the Probability. In assessing the probability of a given x*-value, 
we cannot employ the familiar normal curve. Rather, we must turn to 
a different set of probabilities, which \-ar>' according to the number of 
degrees of freedom. Further, since it would be verj* impracticable to 
provide a full set of probabilities for each df, it is customar>' to present 
only several selected x* significance values and their corresponding 
probabUities for each d / uplo 30. Beyond that point, the noimal proba- 
bilities may be employed because of the convergence of x’ sod r. Part 
of a conventional table is presented in Table 12.2.4; it may be used 

TahU 125.4 Table of x*-Vofu«. hy Seleded ProbcbihVy T’olucs 


D 

.70 

.50 

50 

.10 

.05 

.02 

. .01 

■■ 

.15 

.40 

1.07 

2.71 

354 

5.41 

6.04 


.71 

159 

2.41 

4.C0 

5.99 

752 

951 


1.42 

257 

3.67 

655 

752 

954 

11.31 


250 

356 

45S 

7.78 

9.49 

11.67 

1353 

Bl 

3JOO 

455 

6.06 

951 

11.07 

1359 

15.09 

10 

757 

954 

11.78 

15.99 

1851 

21.16 

2351 

15 

11.72 

1454 

1752 

2251 

24.00 

2856 

3058 

20 

1G57 

1954 

22.78 

2S.4I 

31.41 

35.02 

3757 

25 

20£7 

24.34 

2S.17 

315S 

37.65 

41.57 

4451 

30 

25JI 

2954 

3353 

4056 

43.77 

47.96 

5059 


to evaluate the significance of our computed x**value of .04, 2 dj. Ex- 
^™hung the row entries alongside 2 d/ in the stub, we find that .04 
is not significant at the I pwr cent level, nor is it significant at the 5 per 
cent level — or even at the 50 per cent level. In fact, better than 70 
times in lOO we would obtain x*"''*l“cs larger than .04 when the null 
hypothesis is true. In view of result, we would in all likelihood bold 

405 









Statistical Isference 


tribution of row totals is 50-50 pw cent, wc simply apply this ratio 
to each column sample of 50 items to ohtrun the expected frequency of 
25 in each cell. 

Degrees of Freedom (df). Thcx*™<‘fhod requires that Iho mnrgiral totals 
of the expected frequencies be identical \vitl» tho«c of the ob'crved cell 
frequencies; hence, in our problem it is possible to calculate freely only 
two of the six expected cell frequencies. After any two have been inde- 
pendently fixed, the other four are predetermined by the marginal totals 
and are therefore not free to v.ary. We therefore state that this table 
possesses "two degrees of freedom.” In general, a “rows X columns” 
contingency table will have (r - l)(c ~ 1) degrees of freedom. ITiis 
calculation is made necessary by the fact that the significance of the 
sample X* 'S dependent in part on its degrees of freedom. 

Continuing with the computation of x’. our fAtVd step is to subtract 
each expected frequency from its companion observed frequency (Table 
12.2.3d). Any row (or column) of these deviations must sum to zero, 


Detiadons of Obstned from Inspected 
Table 12.2.3d Frequencies (0 — B) 


Ditixq 

Social Class 


Ttpe 

Upper Middle Ltrutr 


Cross-Class 

2 0-2 

0 

In-Class . 

—2 0 2 

0 

Total 

0 0 0 

0 


a consequence of the aforementioned constraint that observed and ex- 
pected marginal totals be equal. 

Fourth, Tie square the deviations — (0 — £)’ — and thereby eliminate 


4 0 4 
4 0 4 


Fifth, we divide each squared deviation by its e.xpceted frequency — 
thereby norming it on its base; 




5 12^ Testing a Statistical Htpotbssxs 

The ^-values are, of course, manipulated as in the previous demon- 
stration, except that we here illustratively use the conventional compact 
worksheet (Table 12.2.6). With the obtained x* = 45, we consult the 


Table 12.2.6 Chi-Square Worksheet 


Cell No. 

0 

E 

(0-£) 

(0-F)' 

(0 - Ly 

E 

11 

20 

10 

10 

100 

10.0 

12 

5 

10 

-5 

25 

2.5 

13 

5 

10 

-5 

25 

2.5 

21 

5 

10 

-5 

25 

2.5 

22 

20 

10 

10 

100 

10.0 

23 

5 

10 

-5 

25 

2.5 

31 

5 

10 

-5 

25 

2.5 

32 

5 

10 

-5 

25 

2.5 

33 

20 

10 

10 

100 

10.0 




0 


X* - 45.0 


chi-equare table to determine the probabUity of a ^lue at least that 
large under i/«. Scanning the entries alongside 4[’d/ ■» (r - l)(c - 1)3, 
in the stub, we discover that 45 is significant far beyond the 1 per cent 
level, since s'alues even as large as 13.3 would occur only 1 time in 100. 
To determine how much less probable 45 is, it would be necessary to con- 
6ult a table much more extensive than here available. e interpret this 
to mean that religion and occupation are not independent, but rather are 
linked tt^etber, since the observed pattern of joint frequencies b so 
unlikely under if#. 

It should be repeated that the value, x*» “ ® measure of the degree 

of association; its significance level only testifies to the probability of 
its presence. For a measure of the d^ree of correlation, a different 
technique would necessarily have to be resorted to. The reason for thb 
b that raw chi-square b a variable quantity and not normed so aa to 
yield an index ranging between the conventional limits of 0 and 1. 

As with so many techniques employed in statistical testing, the chi- 
square method b applicable only under the circumstance that samples 
Were randomly and independently selected. In addition, the clu-square 
table b valid only when each expected frequency b not small (minimu^ 
^10). When these conditions are impossible of fulfillment, ci^er the 
data must be adapted or alternative procedures must be employed. 

407 






Statistical Interesce 


to Ht; otherwise we would be confronted with a greater than 70 per cent 
risk of rejecting a true proposition, and this is obviously too great a risk 
to incur. 

Chi-Square as a Test of Indfpendenee. Instead of focusing on sample 
differences as reflecting universe differences, it may be more useful to 
view the contingency table os reflecting the degree of association between 
the variables shown in the table. Thus, we may center our attention 
on the possible association between religion and occupation (Table 12.2.5) 


Table 12,2.5 OceupotUm and Religtous AfUiaiion 



OcCCPATTOH 

Total 

Banker 

ilerehant 

Clerk 


20 

5 

S 

30 


5 

20 

5 

30 


5 

5 

20 

30 


30 

30 


90 


Sourf«. Ib^tbeUeat 


and employ the chi-square test to establish the probability of any such 
associatfon'. In this instance, we a^in open vith the null hypothesis 
that the two variables are statistically independent of each other. If 
we reject this hypothesis of iodependence, we tacitly accept the alterna- 
tive that association is present, although the value of x* does not supply 
us with a normed measure of the degree of that assoebtion. 

The calcubtion of x’ for this problem b identical with that previously 
presented, except that we shaB here employ a short-cut technique /or 
computing the expected frequencies. To find the expected frequency 
of any given cell, we amply multiply its respective marginal totals, and 
divide this product hy the grand total. Applying this rule to Table 
12.2.5, We find the expected frequency of the first cell, first row, to be 

» _ 30 X 30 
“ 90 

« 10 

whm the subscript “11” designates the intersection of the first row 
and first column. Once any four of the uine E-values have been freely 
computed in this manner, the remaining five may be calculated as re- 
Biduals from the marginal totals whkh act as constraints. 

•406 






§ 125 Tesidto a SrATisncAi, HirorHesis 


mean cl the sampling distribution is bypothess teru, we need only 
dhride the observ^ r by the standard error: 


r-O 



If a giren s-measure should happen to be 2-5S, the probability of larger 
r’s on the assumption of the null hypothesis would be 1 in a 100, and we 
could reject it at the 1 per cent rignificance level; if the r-measure is 
1.96, the probability of larger r's would be 5 in 100 and we could reject 
Bt at the 5 per cent significance leveL In practice, we do not need to 
citry out calculations of the above type; rather we consult a readily 
availabla table of critical values of r at varying levels of significance. 
An abbreviated table of that type is presented here by way of illustratioiL 
From Table 12.2.7, we see that an obtained r of .09 based on a sample 

Jfinimtoa Velvet cf r, Stleded 
TdbJe 12,2.7 Siffnifieanee Lettlt arid Somple 
Slits 


Sauru Site 
(n) 

Lent or S:oincA.s'cr 


STc 

irc 

50 

5S 

53 

56 

CO 

55 

50 

53 

70 

53 

5S 

51 

SO 

52 

56 

59 

90 

51 

51 

57 

100 

50 

53 

56 

200 

.11 

.16 

.IS 

300 

.11 

.13 

.15 

400 

.10 

-12 

.13 

500 

sn 

JO 

.12 


500 cases would be significant at the 5 per cent level but not at the 
teat level However, for a sample of only 50 cases, the obtained 
' reach a value of J?S in wder to be considered significant at the 
Pw cetd level. Thus, as sample size increases, we may r^ert the null 
JT*'^tLeais of jgfjj cojTclatioo at a fixed significance level with a pro- 


409 




Statistical Inference 


Null }]ypolhesi$: Pearsonian r. The value of Pearsonian r is subject 
to sampling variation as is any other sample statistic.’ Thus, a first 
sample of observations on marriage adjustment and age difference may 
yield an r of .25, while a second yields an r of —.25. In fact, in a long 
succession of sample f’s from this bivariate population, the correlation 
may now be plus, now minus, with neither s%n displaying a tendency 
to dominate. Such a vacillarion between signs would strongly suggest 
that the correlation in the sampled universe might be zero, and that 
each sample r was merely a result of sampling error ascribable to the 
chance factors operative in random sampling 

As a rule, therefore, before rendering an elaborate interpretation of 
an obtained sample r, we should reckon with the possibility that there 
is really no linear relation at all. True, the exploration of social inter- 
action would never he initiated for the express purpose of testing the null 
hypothesis of tero correlation — that is, to demonstrate the essential 
unrelatedncss of things. Quite the contrary, the social analyst is moved 
by an interest in establishing functional relationships. Nevertheless, 
because of the tricks played by chance sampling, it is sound policy to 
investigate the assumption of independence between the two variables. 

The testing for independence between two quantitative variables is 
in principle no different from that of qualitative variables, as set forth 
m the foregoing section: we allege the truth of the null hypothesis and 
then test that assumption by the obtained probability, or Pr-value. In 
this instance, \^e once again employ the normal probabilities, since the 
sampling distribution of r is approximately normal around the popula- 
tion correlation, with standard error expressed: 


(12.2.5) 


provided that n is large, and the population correlation is not greater 
than about ±.8. It follows that, when the population correlation is 
zero — as maintained by the null bypothesh — the sampling distribu- 
tion will have a mean of zero and a standanl eiror: 


( 12 . 2 . 6 ) 


which is, of course, the relevant case for our purposes. 

To test, then, the null hypothecs of no correlation, we conceive of 
the observed sample r as belonging to a norm al distribution having a 
mean of zero and a standard error of 1/Vn-i. To fix the probability 
of an T at least as laige as that obtained, we transform tho sample r to 
a sigma measure and find the corresponding probability. Since the 
408 



i 12^ Tssnso A Statotical Httoteois 

^rol Md fame iheory, which has tended to sharpen the edges of the 
rtea and to inject a certain procedural clarification which it had not 
pre^y po^^. Then, too, with more extended dissemination of 
^tistical tools, the technique has been gaining in general popularity. 

;n ^ hands of a mass of new work- 

. of social studies, many of whom may not have absorbed 

nf r decision-making process, has generated the tjpe 

of ritualism which is here deplored. 

A< Meaning of Bj/poOiesiaf* In order to appreciate even more fully 
me mei^g and purport of the null hypothesis, it will be helpful to 
hr concept of hj-pothesis as employed by statisticians, and 

sciences. Fundamentally, any hj-pothesis is 
nceptual tool in the search for "truth.” A tentative solution to a 
^blem u pro^onaHy set up, and "nature” is asked to confirm or 
ji . * - If subsequent obser\-ation is congruent with the hjpothesis, 
nnlj! ^ accepted; if not, it b rejected,- and another is set up 

one IS reasonably content with the fruitfulness of the inquiiy, 
of of hypotheses is distinguishable from that 

oknsf sciences. In the latter area, the hj-pothesis is an ex- 

readT in research. Hj-potheses of establi^ed prestige come 

them nebular hypothesis, the germ theor>’ of disease, 

th*. efemic theories, the geocentric theory of the universe, 

Catni ^ theory of population growth, Ricardo’s iron law of wages, 

delerm* three stages, Freud’s Oedipus theory, Marx’s economic 

Would launched with considerable confidence that they 

, ^uuf^ed. They were rational, plausible e.TpIaaations of cTCnts, 
dw kt which the past was rendered credible and the future pre- 

A ir to be "true.” 

cxpla* hj-pothesis, on the other band, is not primarily set up for 

*ceat^ purposes, although an expbnation may be implied. Funda- 
‘cterisif’ ^ ®^f“tical hypothesis is a description of a population char- 
(jj. - plausibility of which a checked against the c^idenee in 

'Wlue statistically testable bj-pothesis of a parameter 

eter v*] hj-pothesis. We (!) set up a hypothesis of the param- 

®bferv^*^’ determine whether we could reasonably expect the 

the a- '*|Ue to have been obtained by chance sampling \-ariation on 
As proclaimed hy-pothesis. 

'benulih*™ preceding presentations, the most prevalent u.*« of 

ypothesis, however, has arisen in wliat we may call the esperi- 
Wid it is from this context that its predominant con- 
cf 4 ^JTcnt usage has emerged. We have here another instance 
bypothe^*^ being appropriated by a specific subcategory. The 

M now u«uaUy employed, may lake on several rebted fonau- 



i Vi 2 Testoo a Statisticii Hypot he sis 

tbe concepts “spurious,* “concealed factor,” “selective factor," and 
the like beset the social analyst whenever he attempts to translate an 
in anim ate numerical statement Into a living guide to action. In our 
problem of suicide, it will not be obnous on the surface, for example, 
whether it is “castness” and “westness” that determice the obser%'ed 
erence between suicide rates, or whether there are unidentified and 
wncealed factors that are doing the work falsely credited to “regional” 
actors. Similarly, we cannot be sure that it is habitual smoking that 
etennines the observed difference in the incidence of lung cancer be- 
tween smokers and non-smokers. And even in experimentation, we may 
spuriously credit the experimental factor with work being performed by 
UnknoivTi, hidden variables. 

Students who have not firmly gripped the fundamentals of decision- 
making are not likely to appreciate its probabilistic and inconclusive 
iiature. For example, “no proof of a difference" may be mistaken for 
°° difference.” Similarly, the null hypothesis may be “ae- 
merely “not rejected.” It cannot be too sternly 
the null hypothesis can ne\-er be proved, and m this strict sense 
^ ne^’er be accepted. A more precise, though more cumbersome, 
cfwrdid^ bo: “not to reject the null bj-pothe'is at a given level 

^ There is something to be said for the point of view that, a priori, it 
« extremely unlikely that there should beany absolute identity between 
e pat^eters of populations in the first place, no matter how strong 
e eyidence is for no difference. In practice, the conception of no differ- 
Wee IS probably informally and generally interpreted as “no appreciable 
erenre. In fact, any obser%-ed difference is more consistent with an 
hj'potbesis of a epecific difference than with the null hy- 
esis of no difference. After all, the obser^’cd difference is the best 
of the true difference. This being the case, we should give as 
Wc consideration to an alternative hypothesis of some specific differ- 
w M to the null hypothesis of no difference. 


QuESnOKS AKD PROBLEMS 

• the following ronwpts: 

“Tpothesis Testing 
^Utirtiol Hypothesis 
Hypothesa 
P«u»n-Mtlaag 

(Critical) Ratio 

pling Distribution of the Difference 
^“^uare 

of Independence 

*«t of Homogeoeity 


413 



Statistjcai Intehencs 


lations: "no difference," "leto cotrelation," "chance outcome," "ran- 
dom variation.” In an experiment we seek to determine whether an 
experimental group, having been ghren a treatment, differs from the 
control group, thereby estabbshing the probable effect of the experi- 
mental factor. In order to proceed in the most parsimonious manner, 
it is provisionally assumed that there is no real difference at all, and 
that any observed difference is Mily a chance variation around a lero 
difference. If a difference should be established with a high enough 
probability, we should ideally turn our attention to an alternative hy- 
pothesis about the possible nu^itude of that difference. Unless we know 
the approximate magnitude o! the difference, we are often not much 
better off for purposes of subsequent acUon than we were before we 
accepted the mere presence of some difference. Accordingly, it has been 
suggested that a confidence interval for an observed difference will often 
be more useful than the cntical ratio which tests the null hypothesis. 

The "Social Experiment." But the concept of "treatment” aa employed 
in biological studies, for example, must be liberally interpreted in soci- 
ological investigations. Although experizneatal procedures are by no 
means unknown, they are not widely used, since the human animal is 
not so readily accessible to experimentation. Even though experimental 
methods may be considered by some methodologists as the only valid 
ones, we certainly cannot insist on such methodological purity in the 
present state of sociological science. For some time to come we will 
have to rely on "Nature” to produce crimes, divorces, births, and deaths 
under diverse and uncontrolled conditions which can be standardized 
and held constant only with some difficulty. In practice, we therefore 
frequently make survey studies, or observations of "nature in the raw," 
with no factors under laboratory controL Nevertheless, we still find it 
profitable to apply the null hypothesis in each contexts, sometimes dis- 
paragingly tenned er post faelo investigations, as well as under conditions 
that are more favorable to randomizaUoo, as required by the classical 
experiment. There is some disagreement among students of method- 
ology concerning the relative v^ues of such procedures.* 

Interpretive Isiues in Decision-Making. The decisions consequent upon 
the null hypothesis are encumbered with dilemmas very similar to those 
that plague the interpretation of a correlation. Nature is very complex, 
and any formula will accommodate only a limited number of variables, 
while at the same time responding to iuiumerable hidden factors. Hence, 
* Cf. Han&n C. Selvia, “A Critique td ef St^iSrsBce Svryfy 

Ammeen Soetoloj!£al Senea, XXU, 1957, pit 519-52?; Robert McGinnis, ’'Itandora- 
Uation nnd Inference in Sociological Reseaieb,” American SoCKilogtcal Reitfw, XXIII, 
J9SS, pp. 40S-<14; Leslie Kisb, "Some Statistical Ptobiema in Researcii Design, " 
Ammoin Sodoloeteal Roriew, XXIV, 1959, pp. 32^33. 

412 



i 123 Tbstino a. Statistical Htpothzsis 

(a) Compute X*- 

difierence significant at the JOS lercl? at the .02 level? 

(c) what substantive concluaon would you draw? 

(d) Is this borne out by direct analysis of the table? 


Seixctbs Rbfbrences 

Brai ^wai t^ Richard B., Sdenlifie Bxplanatim, Cambridge UniTemty 
PrwB, New York, 1957. Chaptera S and 7. 

Bishw, I^nald A., Tht Dttign of Erptrimenlt. Oliver and Boyd, Bdia* 
burgh, 1942. Chapter 1. 

Nejma^ Jeny, Firtl Covrte tn ProbabSiljf and StaiuHa. Henry Holt 
and Company, New York, 19Sa Chapter S. 

SsTsye, l«inard J., Tht Foundationt of Statiitit*. John WUey & Sons, 
!«•, New York, 1954. Chapter 4. 

^^AUea, and Harry V. Roberta StalUtia: A ffev Approach, 
^ Free Prws, Glencoe, Illinois, 1956. Page* 395-396. 


415 



Statistical Intimnce 


2. Draw two Bamples of 30 cases eadi from tlie list of 107 suicide rates (Table 
3.1.1a). Compute sample meam and proeeni to test tbe euU bypotbe^ of do 
difference. Would it be possible to reject the null hypothesis, even though both 
samples were drawn from the same population? 

3. 01225 coUe^ men in a sample attrv^, 45 per cent favor the abolition of frater- 
luties; of 100 college women, 40 per cent favor such a policy. Test the null 
hypothesis that men and women do not differ in their attitudes on this issue. 

4. A candidate claimed that 60 per cent of fie elecforaCc would rote for hiss. 
In a sample of 1,000 registered voters, 55 per cent declared for that candidate. 
By means of chi^uare, test the credit^ty of the candidate’s claim. 

5. An investigator compared questionnaire and interview responses to a given 
statement and obtained the results shown in Table 12.23. Use the chi-square 

Table 12,2.8 

Comparison o] Quesfton- 

ftatre ond Inlnvine Re- 

tulis, SubjecU 


Rcbponse 

Cateoobt 

ImTES- QPZSTION- 
VIEW WAIBB 

1. Strongly agree 

2. Agree 

3. Indifferent 

4. Disagree 

3. Strongly dhagrto 

37 25 

17 27 

14 10 

1 7 

0 _0 

69 69 


test to determine whether the difference between responses is significant at the 
5 per cent level. 

6. Table 12.2.6 r^resentsan attempt tomeasive the impact of the Kinsey findings 
on tolerance toward sexual practices. 


Impact of Kinsey Kindinys, Sex ToUrance 
Table 12.2.9 Before ond After Exposure, 1S5S 


Gnopvs 

PoJWT CsAives ijr Djsecnow 
or Toubakcb 

Total 

0 1 2 3 & over 

Experimental (with 
Kinsey data) 
Control (without 
Kinsey data) 

32 27 21 24 

52 n r( IS j 

104 

105 

210 


4t4 




Appendix 


Taslb 1 
Tablb n 

UI 

Tabu IV 
Tabu V 


Areas of the Normal Curve 
Ordinates of the Normal Curve 
Values of x* 

Five Thousand Fandom Digits 

Squares and Square Roots of the Numbers from 
I to 1,000 



T.iiiTB n 


Tflile 11 OrdiTialei of the Normal Cum 



419 







Tifiix I 



418 








Table III 


Table III Vaiues of y} (Continued) 



of a single tail of the normal curve. 


421 














Source; Reprinted from Table IV of R. A. Iwher and F. Yatel, Statutieal Tablet for 
Bioloiieal, Afncuituroi, and il/<diaif Itemreh, publislied b; Oliver & Boyd Ltd., Edio* 
burgh, and by peiniUsiOD of the autbore and pubbahere. 


420 






asssK EBisss aaasa aaaaa ssasa gaaas saass aaa; 


Tisu 17 

Toil* IV Fize Tkovaand Ranion Dibits (Conf»nt«(f) 


SO-M 55-69 60-M 65-69 7«>-74 75-79 6<>-« 85-89 CO-94 95-99 


32547 

16916 

66176 

46299 

22547 

41551 

25444 

47520 

S49T8 

374CK 


312S3 

00041 

5KM7 

13335 

47839 

54160 

59497 

62378 

63271 

89416 


80593 

55023 

27137 

16561 


03345 
30236 
21005 
12180 
45355 

03320 69936 
91556 95917 
95555 83174 
13143 82651 
69035 929S0 


69214 

14253 

03191 

35943 

47520 

34S03 

65553 

130S5 

05271 

49456 


703S1 

76583 

4397D 

59292 

54093 

02479 

25639 

16561 

05322 

74375 


7S2SS 20054 91018 
12092 66533 93426 
64625 22394 39622 
62675 63631 37020 
456S3 55549 51575 

33399 71160 64777 
06455 34174 11130 
63559 26679 06233 
06490 44951 49307 
75610 74976 70056 


32400 654S2 52099 53678 74619 91143 65095 695W 

89262 86333 51718 70663 11623 29534 79CT 7^ 

86S66 09127 9S021 03371 27789 SS444 4^ 3^ 

90814 14S33 05759 74645 05016 94056 990W 6^1 

19192 82756 20553 55446 55376 6S914 75096 26119 

77585 52593 56613 95766 10019 29531 730M 2^ 

23757 16364 05096 03192 62336 4S3S9 8^ 

459S9 96257 23850 26216 23309 21526 074» 5^ 

92970 91243 07316 41467 64337 52406 25^ 5^ 

74346 59596 40088 9S176 17896 86900 20249 77753 

87616 41309 27636 
S0099 71038 45146 

10127 46900 64954 

67995 81977 18954 

76304 80317 54934 

51904 41070 56642 

59537 34662 79631 

51228 10937 63396 

31089 37995 29Sn 

3S207 97933 93159 

88666 31142 00174 89712 63153 62333 

53365 56134 67582 92557 89520 ^52 

89907 74530 3S061 90102 I 1693 90257 

186S2 81038 85662 90915 91631 2229 

63571 32579 63042 25371 0923* 94592 

68927 56492 67799 95398 77642 54913 

56401 631S6 393S9 88793 31356 89235 

24333 95603 02339 72042 46287 95333 

17025 84202 93199 62272 06366 16175 

08233 52133 20224 6S034 50S65 


14621 40130 
28064 75999 
42661 91332 
05299 77511 
04932 30721 


62771 

84SS6 

40672 


16742 

37655 

790S5 

7S195 

&16S9 


91904 

51254 

62717 

15478 

71551 

03591 

30180 

73040 

43SI6 




44103 

54S09 

99581 

05041 

45447 


423 




Tabus IV 


Table IV Five Thousand Random Digits 


59391 58030 52093 82718 87024 

99567 76364 77201 (M615 27062 

10363 97518 51400 25670 9S342 

86859 19558 64432 16706 99612 

11258 24591 36863 55368 31721 


05 95068 8S628 35911 14530 33020 

06 54463 47237 73800 91017 36239 

07 16874 62677 S74I2 I321S 31339 

OS 92491 63157 76593 91316 035l» 

09 15669 S66S9 35GS2 40844 53256 


82848 W190 96574 90464 29065 

96621 43918 01896 83991 51141 

61891 27101 37855 06235 33316 

59798 32803 67708 15297 28612 

94335 34936 02566 80972 08188 

8042S 89936 31S55 34334 64S65 

71824 83671 39892 60518 37092 

62233 80827 73917 82S02 84420 

723S9 96363 62887 01037 66091 

81872 35213 09840 34471 74441 


99116 75486 849S9 23176 62967 67104 39495 39100 17217 74073 

15696 10703 65178 90637 63110 17622 S39S8 71087 84U8 11670 

97720 15369 51269 69620 033SS 13699 33423 67453 43269 56720 

11666 13811 71631 98000 35979 39719 81899 07449 47985 46967 

71628 73130 78783 75691 41632 09S17 61547 18707 85489 69944 

40501 51039 99943 91843 41995 88^1 73631 69361 05375 15417 

22518 55576 9321S 82068 10798 86211 36581 67466 69373 40051 

75112 30485 62173 02132 14878 92379 22231 16783 86352 00077 

80327 02671 98191 84342 00313 49263 95141 15496 2016$ 09271 

60251 45548 02146 05597 48223 81366 34593 72856 66762 17002 

57430 82270 10121 05540 43648 7SSS3 66049 21611 47676 83444 

73528 89559 34454 88596 54076 71693 43132 14414 79949 85193 

25991 65959 70769 64721 86113 35175 42740 06175 82753 66248 

78388 16638 00134 59880 63S06 48472 39318 35454 24057 74739 

12477 09965 96657 57991 59439 76330 24596 77515 09577 91871 

83266 32SS3 42451 15579 38155 29793 40914 65990 16255 17777 

76070 80S76 10237 S9S15 79152 74793 39357 09054 73579 92359 

37074 6S19S 44785 68624 9S336 84491 97610 78735 46703 98265 

83712 06514 50101 78295 54656 85417 43189 60048 72781 72606 

202S7 56862 69727 94443 61936 08366 27227 05153 50326 59566 

74261 32592 86538 27041 65172 85532 07571 80609 39285 65540 

640S1 49863 03478 96001 I8SS3 14810 70515 89755 59064 07210 

05617 75S18 47750 67314 29573 10526 66192 44464 27053 40167 

26793 74951 93466 74307 13330 42664 85515 20632 05497 33625 

659SS 72850 4S737 54719 52056 01596 03815 35067 03134 70322 

27366 42271 44300 73399 21105 03230 73457 43093 05192 4S657 

56760 10909 98147 34736 33863 95256 12731 66598 50771 83665 

72380 43338 93643 58901 59513 23813 11231 83263 65938 81581 

77888 38100 03062 58103 47%l 83841 25S78 23746 55903 44115 

28110 07819 21580 51459 47971 29832 13990 29226 23608 15873 


63525 91141 77033 12147 i 

47606 93410 16359 89033 ! 

52609 45030 96279 14709 I 

I673S 60159 07425 62369 I 

5934S 11695 45751 15865 : 

12900 71775 29845 60774 ! 

76086 23537 49939 33593 

09495 51434 29181 09993 : 

20075 3I67I 453S6 36583 ! 

13636 93596 23377 51133 ' 


> 5S312 76923 96071 05313 

61498 31776 05383 39902 

: 02735 50S03 72744 88203 

37875 71153 21315 00132 

: 326SS 20271 65128 I4S5I 

I 3SG36 33717 67598 82521 

; 2S617 17979 70719 35234 

I 68922 53125 91077 40197 

I 52022 41330 60651 91321 

i 42474 45141 46660 4233S 


Source: Abridged from Table 1.5.1 ia G. W. Snedecor, SialwJicil .\telAod$, 6lh edition 
Iowa State UmTeraity Press, Amc*, Iowa, 1936, by permission of tbe publisfcer. 




TlBtE V 



425 









Table V 


Table V Squares and Sgntare Roots oj the Numbers from I to 1,000 



42i 










Table V 






Table V 

Sguares and Square Roots oj the Nurnbers from I to 1,000 
Table V {Continued) 



426 









Squares and Square i 
Table V (Coniinu«(i) 


* of the Ntcmiers from I to 1,000 



428 


fIJISBS 










Tasie V 


Sjuares and Square Roots of the Numbers from 1 to 1,000 
Tatfe V {Continued) 



431 


Table V 

Squares and Square RoMs af the Numbers from 1 to 1,000 
Table V {Conlintud) 



430 








Table V 

Squares and Square Roots of the Numbers from 1 to 1,000 
TabUV {QoniinueSy 














Table V 


Squares and Square RooU c/ the Numbers from I to 1,000 
Table V {Contimied) 



432 










Indexes 



Ackoff, R. L., 359 
Anderson, 0. N., 10 

Berelson, B., 66, 261 
Bernoulli, D., 7 
Bogus, D. J,, 43 
Boole, Q., 236 
BnitWaite, R. B., 415 
Buekland, W. R., 10 
Buckle, H. T., 5, 33S 
Buell, P., 414 

Burgess, E. W., 64, 269, 345 

Cahnman, W. J., 355 
CantrU, H., 61 
Centers, R., 55 
Clark, R. C., 278 
Coctnai, W. G., 359, 378, 379 
Cohen, M. R., 10, 30 
Cottrell, L. 8., 64, 269, 345 
Cowden, D. J., 109 
Croxton, F. E., 109 

Dantzig, T., 30 
Davies, G. R., 279, 317 
Davis, B., 261 
Davis, K., 108, 250 
Davis, K. B., M 
Driver, H., 251 
Duncan, 0. D., 261 
Dunham, W., 94 
Duroet, W. N., 74 


Index of Names 


Einstelo, A-, 208 
Etekial, M., 336 

Fugle, D., 282 
Fsris, R. E., 94 
Funsnortb, P. R., 280 
Feller, W., 237 
Fermat, P. de, 7 
Festinger, L., 359 
Fisher, R. A., 7, 410,415 
Fox, K. A., 336 
Freeman, R>, 58 


GaltoD, F.. 7, 304, 327 
Gaudet, H-, 66, 261 
Gauss, K. F, 7 
Ginsberg, M-, 63 
Goodman, L. A., 249, 269 
Graunt, J., 7 ' 

Guttman, L., 269 

Halley, E.. 7 
Hankins, F> H., 5 
Hatch, D., 355 
Hatch, M., 355 
Halt, P. K., 203 
Havemann, E., 359 
Hobhouse, It. T., 63 
HoUmgshead, A. B-, 60, 268 
Huff, D.. 10 
Hume, D., 320 
Hyman, H., 336 


437 



Index of Subjects 


Addition theorem, 217-218 
Array, 35 

Aswciatton: 57 8, 23S 8 
of attributes, 240 
coefBcieot of, 242-244 
complete, 253 
defiiution of, 249-247 
partial, 200 
perfect, 253 
Attribute, 12-13 
Average deviation, 159 
Average iatercomlation of ranks, 275- 
276 

Averages: 112-114 
characteristics of, 145-146 
choice of, 143-149 
comparability of, 146-149 
mean, 129-142 
median, 121-129 ^ 

mode, 115-120 

Bias; 22 
sampling, 347 

Binondal distribution, 231-234 

Case study, 315 
Gentile, 90, 127 

Central tendency, 112; tee also Average 
Chance: 206-208 
definition of, 207 
error, 22 

frequency, 246-247, 263 
laws of, 209 


Change, measurement of, 69-71, 102, 
107 
Chart: 
bar, 92-93 
base map, 94-95 
eemi-logarithmie, 102-107 
time, 9S-101 
(See oUo Graph) 

Chi-equare: 262-264 
computation of, 263, 402-405 
prol^ility, 405 
table of, 405 
test of independence, 406 
test of significance, 403 
validity of, 407 
Class interval: 36-38 
midpoint of, 2S, 41 
unequal, 78-81 
Class limits: 
rounded, 39-41 
true, 28-29, 39-40 
Coding, 133-135 

Coefficient of variation, 159-161, 166^ 
167 

Combinations and permutations, 223^ 
227 

Concealed factor, 189, 329, 412-413 
Confidence interval: 367,372 
of mean, 370-372 
of percentage, 377-379 
Contingency: 
principle of. 233 
table, 403 



I?TDinc or Kaues 


InfeW, L., 203 
JrnkinsoD, B. L., 74 

Katt, D., 359 
Kemcny, J. G., 7-}, 237 
Kendall, M. G., 10, 74, ISO, 180, 253, 
269, 330 

Kinsey, A.. 344,359.414 
Kirkpatrick, C., 414 
Kish, L., 359, 412 
Kline, lif., 10 
Kruskal, \V. 11., 249, 269 

Undis, P. H., 203 
Laplace, P. S., dc, 237 
lAianfeld, P. F., 30, 66. 203, 261 
Levinson, H. C., 237 
Lew. E. A., 103 
Lowenatein, D., 109 
Luadberg, G. A., 10 
Lj-nd. il. M., 345 
Lysd,A.8.. 343 

MabaUnobis, P. C.. 10 
McCarthy, P. 150, 180, 3G0 
McGianis, Il.,412 
McKay. II. D., 46 
Mertoo, R. K., 203 
Modley, B.. 109 
hloivre. A., 7 
Moroney.M.J.. ISO, 180 
Moateller, F., 359 
Mueller, J.IL, 162, 1S6 
Mueller, K. H.. 186 
Murdock, G. P., 260 

Nagel, E., 237 
Neyman, J., 237, 415 

Parten, M. D., 360 
Pascal, B., 7 

Pearson, K., 7, 171, 267, 307, 329 
Pemberton, II. E., 72 
Peters, C. C., 276 
Polya, G., 237 

Qu<telet, A., 4, 5, 7, 169, 338 


Reinhardt, J. M., 279, 317 
Rid>erts, If. V.. 10,415 
RoUnson, W. S., 336 

Savage, L. J.. 237, 4J5 
Schmid, C. F., 109 
Selvin, H. C.,412 
Shaw. C. IL, 46, 293 
Silbcrtnan, L., 203 
Simmons, L. W., 2G0 
Simon, H. A., 336 
Snedecor, G., 336 
Snell. 3. L., 74, 237 
Spear, M. E., 109 
Spieselman, M., 103 
Stephan, F. F.. 360 
Stouller, 8. A., 64, 250, 251, 258 
Stryker, S.,414 
Sumner, W. G., 205 
Siissmilch, J. P„211 

l^ompson, G. U, 74, 237 
Tbompso®, S., IfiO 
Toby. 3., 259 
Toijrrsen, Vf. 8., 30 
Tukey, 3. W., 559 

Van Voorhla, W. R., 276 
Vena, J., 5, 237 

Walker, 11., 10, 80,74 
WaMis. W. A., 10. 415 
Warner, W. L., 296 
West. P. 8., 359 
Wheeler, C. C., 63 
\t^elptoo. P. IL, 58 
WUcox, W. F., 10 
Woodbury, R. M., 10 

Yales, F., 360 

Yule. G. U., 150, 180, 244, 253, 269, 325, 
329 

Z^l, n., 52, 74, 203 
Zelditcb, M., Jr., 269 . 

Zeleny, C., 215 
Znaniecld, F., 345 


438 



Ikdei or ScEJECis 


Joint frequence' tablf, 5S, 62, 240, 2S7, 
290 

Law of large number?, 212 
Least squares, principle of, 162-163,301 
Lilfrcry poll, 347, 3S5 
loeaiioo, measures of; 110*115 
maximum, 112 

tninironm ^ 112 

^larginal distribution, 61, 245, 254- 
257, 290-291 
ilean: 129-142 
of binomial distribution, 231 
eharacterirtics of, 146 
guessed, 135 
wei^ting of, 135-141 
Jltasniemenl: 11 ff 

behavioral eqcrraleota, J6 
enumeration, 16 
error in, 19-23 
ranging, 17-lS 
standard unit, 16 
Median: 120-126 
ehaneteristies of, 146 
of discrete data, 125-126 
Mode: 115-120 
bioodalitf, 115-119 
ehamterirties of, 145 
degree of modality, 120 
hloltiplication theorom, 

Konnal frequency distiibation: 169, 
233 

eharacteristim of, 170-171 
table of, 171 
Nornimg: 151 £f 
index, 185-187 
percentage, 1S2 
rate, 183 
ratio, 1S2 

standardization, 192-194 
subclassiS cation, 159-192, 199, 330 
Xull hypothesis: 235, 358 ff, 393, 408- 
409 

applications of, 399 ff 
current issues, 410 
testing of, 390 

Observed frequency: tee Expected 
frequency 
0 ^, 66-68 
Outcome: are Event 


Parameter: 337 
estimation of, 361 B 

Permutations and combinations, 223- 
226 

Hu-coeEcient: 252 E 
maximum, 255-253, 265 
Population: are Universe 
Probability: 204-213 
a priori, 210-211 
biaomtaL 22S. 229 ff 
calenlation of, 214-221 
empiricaL 211, 232 
possibility set, 209 
ringle event, 220*221 

<^£dent, 242-249 
Quantfles; 127, 155-156 
eentile. 90, 127 
quartHe, £9, 127 

Range--152. 

Qtennediate;, 154-156 
eeasnreecst of, 153 S 
totaL 153 

Rash-order correlation, 271-272, 276 
Rtgresson: 233-284,297,303,315-316 
eairOlDear, 234 
linear, 2S3, 295 
RelabOHy, sample, 366 
Rounding; 25-27 
error, 26 

eigniScast digita, 27 

Sample, Lileray Diyett, 347 
Sample, rise of, 368, 372-374, 377, 379— 
381, 383 ff 
Sampling: 337 £f 
adrantages of, 335-339 
design of, 356 
error, 362, 365, 368 
philosophy of, 337 
pur pos e of, 337 
traits, 340 
units, 340 

Sampling distribution: 362ff,374,393 
of difference, 392, 
empirical, 365, 374 
of mean, 364 
Bomal, 372, 379 
of percentage, 378 
theoretical, 3M 


441 



IkDEX of StTBitCTS 


ContinKency coefficient. 202(1 
maximum, 26.'5 
Correlation' 

cause and effeit, 33S-32" 
common factors, 327-32S 
interpretation of, 324 ff 
self-correlation, 32S 
spurious, 328-330 
types of, 330 fl 
Correlation coefficient- 

comparison of r and r’, 303 
computation of. 300-311 
definition of, 307 
test of significanee for, 40S-409 
Correlation, measures of: 
average intercorrelation of ranks, 276 
coefficient of association (Q), 2t4 
coefficient of correlation (r), 305 fl 
coefficient of deterrnination {r’), 303- 
304 

contingency coefficient (O, 201 
correlation ratio 320 
phi nocfficicnt, 352 B 
rank-order (p), 370-272 
Correlation ratio, 313 fl 
Corrriatlofl table, 311-314 
Covariation, 340, 27Q 
Critical ratio, 323 
Crosa^faaaiflcaiioo.' 5711 
Dormiog of, 199-200 
test of significance for, 40(M0? 
Cumulative freduesey distnbutioo, 44- 
45 

Cumulative frequency polygon (ogive), 
85-90 

Decision-making: 213, 233, 361, 3SS, 
394 fl, 412 
errors in, 390 
Degrees of freedom, 404 
Discrete variable, 14-15, 42 

Error: 
chance, 22 
constant, 22 
in decision-making: 

Type I and II, 3g6n 
grouping, 39, 41 
in measurement, 19 ff 
rounding, 20 
sampling, 362 
Estimation; 

interval estimate, 366 ff, 378 
440 


Hiitunation (cml‘d): 
point estimate, 367 
statistical, 3SS 
Event (outcome): 206,209 
alternative, 217 
dependent, 218-219 
factors in, 206 
chance, 207-209 
determining, 206-203 
independent, 218-219 
joint. 216-217 
simple, 215-216 

Expected frequeney, 186, 263, 406 
Explained variation, 208-299, 303 

Faeter: 
chance, 207 
determining, 200-208 
Finite population multiplier, 379-3S1 
Frequeney distribution, 31, SO-BS 
Frequency polygon, 82--83 

Graph: 75 ff 
arithmetic, 76, 9S-10J 
bar chart, 92 
break, 99 

cumulative frequency poIyg^B 
(ogive), 86 

frequency poljgon, 82 
histogram, 75-78 
Kmidogarithmlc, 102-106 
Grouping: 34-36 
error, 39. 41 
procedure, 2S9-290 

Ilblr^ram, 75-81 
Homoscedasticity, 237 
Hypothesis: 9 
alternative, 389, 393 
of homogeneity, 401 
of independence, 402 
meaning of, 411 
null, 3S8 ff 
statistical, 411 
testing of, 390 ff 

Index: 185-187 
of qualitative variation, 177 ff 
(See also Norming) 

Inference, statistical: 361 ff, 377 
estimation, 361 ff, 3SS 
hypothesis-testing, 3S3, 390 ff 



Index or Sciijects 


Sampling procedures; 343 ff, 356 
availability, 344, 333 
biased, 347 
cose study, 345 
duster, 352-353, 35C, 357 
destructive, 339 
haphazard, 343 
interval, 354 ff 
judgment, 345 
multistage, 352 
non-randora, 343 ff 
random, 343, 348 ff 
by random digits, 349 
simple random, 34S, 353 
stratified. 3S6-3SI, 350, 357 
systematic, les Interval 
Scatter diagram, 331 ff, 306, SIS 
Semi-logarithmic charts, 102-106 
Significance: 

of difference, 339, 393, 399-400 
level of, 394. 393. 407, 409 
ratio, 393 
teal of, 402 

Standard deviate (a-measure). 172- 
173 

Standard devlatico; 164 
of binomial distributloo, 231 
characteristics of, 167-163 
computation of, 165-160 
Standard error: 367-363 
of the difference, 392, 393, 400 
estimate of, 369 
of the mean, 363-369, 330 
of the percentage, 378 
Standard million, 19S 
Standardization: 102-194,202,330 
of mean, 196 
of rate, 193 

Standardized measures, 127, 174 
Statistic: ire Parameter 
StaUsUcal rcaacming, 6-9 
Statistics: 
definition of, 3, 4 
descriptive, 7, 361 
etymology of, 6 
evolution of, 6-8 
fallacies of, 5 
validity of, 4-6 


Sabelassification, 189-192, 200, 202 
Sufficient reason, principle of, 210 

Tabulation: 33-30, 51-66 
UvarLate, 5S 
c<ta*.troction of, 43, 53 
eccfusivencss, principle of, 51 
inclusivcness, principle of, 51 
mixed variables, 51-52 
multiple entries, 54 
overlapping categories, 53 
parUal, 200 
2x1 table, 242 
Tree diagram, 224 
Trial, definition of, 214 
Time series: 66 fl 
cumulated, 72 
graphs of, 63-101 
login, 316n, 333 

Universe: 337, 339 
classification of, 340 
definition of, 339 
finite, 340 

heterogeneity of social, 3S2 
hypothetical, 333 
infinite, 340 
instability of social, 331 
aampied, 341 
target, 341,383 

Variable; 
contiDuous, 14-15 
dependent, 231 
discrete, 14, 43 
independent, 61, 281 
intervening, 330 
<)uaUtati%-e, 12-13, 31, 50, 61 
quantitative, 12-14, 31 
Variance, 164 
Variate, 12 

Variation (in correlation): 
ex[damed, 293-299, 303 
re^ual, 299-301 
total, 300, 303 
Variation: 151 ff 
chwee of measure, 174-17S 

Yule’aQ, 242-249 


442 



